Identification and use of antiviral compounds that inhibit interaction of host cell proteins and viral proteins required for viral replication

ABSTRACT

The present invention relates to the identification of host cell proteins that interact with viral proteins required for virus replication, and high throughput assays to identify compounds that interfere with the specific interaction between the viral and host cell protein. Interfering compounds that inhibit viral replication can be used therapeutically to treat viral infection. The invention is based, in part, on the Applicants&#39; discovery of novel interactions between viral proteins and a human host cell proteins. One of these host cell proteins, referred to herein as NPI-1, interacts with influenza virus protein NP. Also, host cell proteins, referred to herein as NS1I-1 and NS1-BP interact with influenza virus protein NS 1 . In addition, host cell proteins containing WW domains that interact with viral proteins such as Rhabdoviral M protein are described. Compounds that interfere with the binding of the host cell and viral proteins, and inhibit viral replication can be useful for treating viral infection in vivo.

The present invention is a continuation-in-part of U.S. patentapplication Ser. No. 08/444,994, filed on May 19, 1995 and is entitledto and claims priority benefit to U.S. provisional application SerialNo. 60/148,263, filed on Aug. 11, 1999, the entire contents of each ofwhich is incorporated herein by reference in its entirety.

The work reflected in this application was supported, in part, by agrant from the National Institutes of Health, and the Government mayhave certain rights in the invention.

1. INTRODUCTION

The present invention relates to the identification of new cellulartargets for viral intervention, the identification of antiviralcompounds that act on the new targets, and the therapeutic use of suchantiviral compounds.

2. BACKGROUND OF THE INVENTION

Replication of viruses can induce drastic changes in the infected hostcell metabolism. The analysis of the replication cycle of viruses bymolecular biological techniques has facilitated the identification andstudy of viral gene products that modulate and affect cellular functions(Knipe, 1996, in Fields Virology-1996, Fields, et al., eds., RavenPublishers, Philadelphia, Pa., p. 273-299).

2.1. Influenza a Viral Gene Products that Modulate host CellularFunctions

Influenza A virus is a negative strand RNA virus belonging to theorthomyxovinis family. The genome of the virus consists of 8 segmentsand encodes 10 polypeptides. Experimental evidence generated in thelaboratory of Scholtissek indicates that the nucleoprotein (NP) is amajor determinant of species specificity of influenza viruses(Scholtissek, et al., 1985, Virology 147: 287-294).

2.1.1. NP PROTEIN

Transcription and replication of influenza virus RNA takes place in thenucleus of the infected cell. Transcription and replication of influenzavirus RNA requires four virus encoded proteins: the NP and the threecomponents of the viral RNA-dependent RNA polymerase, PB1, PB2 and PA(Huang, et al., 1990, J. Virol. 64: 5669-5673). The NP is the majorstructural component of the virion that interacts with genomic RNA, andis required for antitermination during RNA synthesis (Beaton & Krug,1986, Proc. Natl. Acad. Sci. USA 83:6282-6286). NP is also required forelongation of RNA chains (Shapiro & Krug, 1988, J. Virol. 62: 2285-2290)but not for initiation (Honda, et al., 1988, J. Biochem. 104:1021-1026).

Phylogenetic analysis divides NP genes into two families: one containingNPs predominantly of avian origin, and one containing those of humanorigin (Bean, 1984, Virology 133: 438-442; Buckler-White & Murphy, 1986,Virology 155: 345-355; Gammelin, et al., 1989, Virology 170: 71-80;Scholtissek, et al., 1985, Virology 147: 287-294). The human virusA/HK/1/68 and viruses having genetically related NPs cannot rescuemutants of the avian virus A/FPV/Rostock/1/34 (FPV) with temperaturesensitive (ts) defects in the NP following double infection of chickenembryo fibroblasts (CEF) at 40° C. (Scholtissek, et al., 1985, Virology147: 287-294; Scholtissek, et al., 1978, Virology 91: 79-85). However,the human viruses that failed to rescue the ts mutants on CEF cells wereable to do so on Madin-Darby canine kidney (MDCK) cells (Scholtissek, etal., 1978, Virology 91: 79-85). Additionally, A/HK/1/68 virus andA/FPV/Rostock/1/34 virus reassortants containing the A/HK/1/68virus-derived NP replicate in MDBK cells (bovine kidney) but not in CEFs(Scholtissek, et al., 1978, Virology 91: 79-85). The host-specificrescue of FPV ts mutants and the host restriction of A/HK/1/68 virusreassortants suggest that a factor(s) of host origin, which differsbetween mammalian and avian cells, is responsible for this phenomenon,and that this factor may interact with the influenza A virus NP.However, no host protein(s) that interacts with NP during infection haspreviously been identified or characterized.

2.1.2. NS1 PROTEIN

The NS1 protein of influenza A viruses is known to modulate and affectcellular functions. The NS1 is the only non-structural protein of thevirus and is abundantly expressed in infected cells (Lazarowitz, et al.,1971, Virology 46: 830-843).

Several regulatory finctions of the NS1 protein have been identified.The NS1 protein may influence multiple steps of gene expressionincluding pre-mRNA splicing (Fortes, et al., 1994, EMBO J. 13: 704-712;Lu, et al., 1994, Genes Dev. 8: 1817-1828), nucleo-cytoplasmic transportof poly(A)-RNA (Fortes, et al., 1994, EMBO J. 13: 704-712; Qiu, Y., etal., 1994, J. Virology 68: 2425-2432) and translation (De La Luna, S.,et al., 1995, J. Virol. 69: 2427-2433; Enami, K., et al., 1994, J.Virol. 68: 1423-1427). In addition, NS1 can block the activation of thedouble-stranded RNA (dsRNA) activated protein kinase (PKR), presumablydue to its dsRNA binding activity (Lu, et al., 1995, Virology 214:222-228). The activation of PKR results in a downregulation oftranslation and is part of the cellular antiviral defense mechanisms TheNS1 protein may counteract this cellular response in order to synthesizehigh levels of viral proteins in the infected cell (Lu, et al., 1995,Virology 214: 222-228). These pleiotropic effects may singly or combinedprovide the molecular basis for the role that the NS 1 protein plays indetermining the host range and virulence of influenza virus strains(Shimizu K., et al., 1983, Virology 124: 35-44; Treanor, J. J., etal.,1989, Virology 171: 1-9).

Despite these studies examining the activities of NS1 and itsinteractions with various RNAs, little is known about the cellularfactors that are recognized by the NS1 protein and that may therefore becentral to NS1 functions.

2.2. RHABDOVIRUS GENE PRODUCTS THAT MODULATE HOST CELLULAR FUNCTIONS

Viruses belonging to the Rhabdoviridae family cause disease in a widevariety of species including vertebrates, invertebrates, and plants(Wagner & Rose, 1996, In Fields, et al., (eds.), Fields Virology, 3rdedition, Lippincott-Raven Publishers, Philadelphia, pp. 1121-1135). Twoprototypic members of the Rhabdoviridae family include vesicularstomatitis virus (VSV; genus=vesiculovirus) and rabies virus(genus=lyssavirus). Like influenza A virus, rhabdoviruses possess anegative-strand RNA genome. Rhabdoviruses replicate exclusively in thecytoplasm of infected cells, and derive their lipid envelope via buddingthrough the cytoplasmic membrane (for review see Wagner & Rose, 1996, InFields, et al., (eds.), Fundamental Virology, 3rd edition,Lippincott-Raven Publishers, Philadelphia, pp. 1121-1135).

2.2.1. MATRIX (M) PROTEIN

Many aspects of the replication process of rhabdoviruses remain unclear.The major structural protein of rhabdoviruses, the matrix (M) protein,is thought to play a key role in viral assembly and release (Chong &Rose, 1993, J. Virol., 67, 407-414; Chong & Rose, 1994, J. Virol., 68,441-447; Kaptur, et al., 1995, Virology, 206, 894-903; Lenard, 1996,Virology, 216, 289-298; Lyles, et al., 1992, J. Virol., 66, 349-358;McCreedy & Lyles, 1989, Virus Res., 14, 189-205; Mebatsion, et al.,1996, Cell, 84, 941-951; Pal & Wagner, 1987, In, Wagner (ed.), TheRhabdoviruses. Plenum, New York, pp. 75-128; Newcomb, et al., 1982, J.Virol., 41, 1055-1062; Zakowski, et al., 1981, Biochemistry, 20,3902-3907). When the M protein of VSV is expressed in mammalian cells ora baculovirus system in the absence of any other viral protein, Mprotein is released from the cells in the form of lipid vesicles bybudding through the cytoplasmic membrane (Justice, et al., 1995, J.Virol., 69, 3156-3160; Li, et al., 1993, J. Virol., 67, 4415-4420). TheN-terminal portion of the VSV M protein has been shown to be importantfor membrane localization, and thus the budding process (Chong & Rose,1993, J. Virol., 67, 407-414; Chong and Rose, 1994, J. Virol., 68,441-447; Lenard and Vanderoef, 1990, J. Virol., 64, 3486-3491; Ye, etal., 1994, J. Virol. 68, 7386-7396; Zakowski and Wagner, 1980, J.Virol., 36, 93-102). The precise mechanism of how M is released fromcells and the potential function(s) of host proteins in the buddingprocess remain unclear.

The role of the M protein in rhabdoviral assembly has been compared tothat of the gag protein in retroviral assembly (Lenard, 1996, Virology,216, 289-298). The gag protein of Rous sarcoma virus (RSV) and the Mprotein of VSV share the ability to associate with the cytoplasmicmembrane, and to bud from cells independent of other viral proteins(Justice, et al., 1995, J. Virol., 69, 3156-3160; Li, et al., 1993, J.Virol., 67, 4415-4420; Wills, et al., 1994, J. Virol., 68, 6605-6618).In addition to the membrane association (MA) domain of RSV gag, a late(L) budding domain has been identified in the p2b protein of RSV gag andshown to play an essential role in the late stage of budding (Wills, etal., 1994, J. Virol., 68, 6605-6618).

Interestingly, a sequence in the RSV L domain (PPPY) matches thesequence of the consensus motif required for interacting with WW domainsof cellular proteins (Chen and Sudol, 1996, Techniques in ProteinChemistry VII, 7, 3-12; Chen, et al., 1997, J. Biol. Chem., 272,17070-17077; Macias, et al., 1996, Nature, 382, 646-649; Sudol, et al.,1995, J. Biol. Chem., 270, 14733-14741). While L domains have beenidentified in the gag proteins of other retroviruses, only the gagproteins of the oncoviruses appear to have the PPXY motif conserved(Gottlinger, et al., 1991, Proc. Natl. Acad. Sci. USA., 88, 3195-3199;Huang, et al., 1995, J. Virol., 69, 6810-6818; Parent, et al., 1995, J.Virol., 69, 5455-5460; Puffer, et al., 1997, J. Virol., 71, 6541-6546;Wills, et al., 1994, J. Virol., 68, 6605-6618). The recently describedWW domain is (i) a highly structured, modular domain that mediatesprotein-protein interactions, (ii) present in a wide range of cellularproteins with unrelated functions, and (iii) functionally similar to,but structurally distinct from, Src homology-3 (SH3) domains (for reviewsee Sudol, 1996, In Blundell, et al., (eds.) Prog Biophys. Molec. Biol.,Vol. 65, Elsevier Science Ltd., Great Britain, pp. 113-132). The biologyof the WW domain and its interacting ligands have been implicated inplaying a role in a number of disease states including Liddle's syndrome(a genetic form of hypertension), muscular dystrophy, and Alzheimer'sdisease (Bork and Sudol, 1994, Trends Biochem. Sci., 19, 531-533;Einbond and Sudol, 1996, FEBS Lett., 384, 1-8; Staub, et al., 1996, EMBOJ., 15, 2371-2380; Sudol, 1996, In Blundell, et al., (eds.) Prog.Biophys. Molec. Biol., Vol. 65, Elsevier Science Ltd., Great Britain,pp. 113-132). In addition, the WW domain has also been implicated in thebiology of retroviral budding and assembly (Gamier, et al., 1996,Nature, 381, 744-745; Sudol, 1996, In Blundell, et al., (eds.) Prog.Biophys. Molec. Biol., Vol. 65, Elsevier Science L td., Great Britain,pp. 113-132). Indeed, the L domain of RSV gag mentioned above has beenshown recently to interact with the WW domain of the cellular Yes-kinaseassociated protein (YAP) (Garnier, et al., 1996, Nature, 381, 744-745;Sudol, 1994, Oncogene, 9, 2145-2152).

Thus, little is known about host cell functions that contribute to theintracellular replication of negative-strand RNA viruses such asinfluenza and rhabdoviruses. No cellular factors, or interactionsbetween cellular factors and viral proteins, have been previouslycharacterized that can be used as targets for therapeutic intervention.

3. SUMMARY OF THE INVENTION

The present invention relates to the identification of host cellproteins that interact with viral proteins required for virusreplication, and high throughput assays to identify compounds thatinterfere with the specific interaction between the viral and host cellprotein. Interfering compounds that inhibit viral replication can beused therapeutically to treat viral infection.

The invention is based, in part, on the Applicants' discovery of novelinteractions between viral proteins such as NP and NS1 nfluenzaproteins, the rhabdovirus M protein, and human host cell proteins orprotein domains referred to herein as NPI-1, NPI-2, NPI-3, NPI-4, NPI-5,NPI-6, NS1I-1, NS1-BP, and cellular proteins containing WW domains,respectively. Host cell protein s such as NPI-1 and NS1I-1 may beaccessory proteins required for replication of the viruses. Compoundsthat interfere with the binding of viral proteins with host cellproteins or protein domains, and that inhibit viral replication, can beuseful for treating viral infection in vivo.

4. DESCRIPTION OF THE FIGURES

FIGS. 1A and 1B: The interactive trap system, as used in theidentification of NP- and NS1-interacting proteins. FIG. 1A: R100contains the reporter gene LexAop-LEU2 and a transcriptionally inactiveLexA-NP fusion protein (left). Libra ry proteins are synthesized in R100transformants in media containing galactose. If the library protein doesnot interact with the LexA-NP fusion protein, then the LEU2 gene is nottranscribed (middle). If the library protein does interact with theLexA-NP fusion protein, then the LEU2 gene is transcriptionally active,and the cell forms a colony on leu⁻ medium (right). FIG. 1B: ThepLexA-NP bait plasmid used in the interactive trap. The coding region ofinfluenza A/PR/8/34 virus nucleoprotein (NP) was subcloned into theEcoRI and Sal I restriction sites of pEG202. This construction encodes afusion protein which includes 202 amino acids of LexA and the entirecoding region of NP (498 amino acids) separated by 3 amino acids encodedby polylinker sequences derived from the cloning process. The nucleotidesequence and amino acid sequence depicted correspond to SEQ ID NO: 1 andSEQ ID NO:2, respectively.

FIGS. 2A-2D: Nucleotide sequence of NPI-1 cDNA (SEQ ID NO:3)and theencoded polypeptide sequence (SEQ ID NO:4).

FIGS. 3A-3B: Comparison of the amino acid sequence of NPI-1 (SEQ IDNO:4) and the amino acid sequence of SRP1 (SEQ ID NO:5). Vertical linesindicate identity; colons and periods indicate conservative changes(Deveraux, et al., 1984, Nucl. Acids Res. 12: 387-395). 42 amino acidARM repeats are aligned vertically according to Peifer, et al., 1994,Cell 76: 789-791. For a complete comparison of SRP1 to other ARM repeatcontaining proteins, see Peifer, et al., 1994, Cell 76: 789-791. The ARMconsensus sequence is indicated at the bottom; “+” indicates K,R, or H;“−” indicates D or E; “˜” indicates a gap. Since other residues areconserved within the repeats of NPI-1 and SRP1, a consensus sequencederived from only these two proteins is also shown.

FIG. 4: GST-NPI-1 binds to NP in vitro. GST (lanes 1, 5, 6) andGST-NPI-1 (lanes 2, 3, 7, 8) were expressed in bacteria and precipitatedfrom cell lysates on glutathione agarose beads. The complexed beads werethen incubated with partially purified influenza virus NP and polymerasepreparations (Pol/NP) as indicated. Precipitated proteins werefractionated on a 12.5% SDS polyacrylamide gel, and either stained withCoomassie blue (lanes 1 to 3), or immunoblotted using the monoclonalantibody HT103 directed against the viral nucleoprotein (lanes 4 to 8).Unprecipitated Pol/NP was separated in lane 4. M, protein molecularweight markers; *, GST-NPI-1 fusion protein; arrows indicate majorfusion protein degradation products.

FIG. 5: Immunoblot of total cellular proteins using polyclonal rabbitsera against NPI-1. Total cell lysates and cytoplasmic cell extractsfrom HeLa and MDBK cell lines were separated by SDS-PAGE, transferred tonitrocellulose, immunoblotted with anti-NPI-1 sera, and developed by¹²⁵I-protein A. Each lane contains protein from 1×10⁵ cells.

FIG. 6: NP is co-immunoprecipitated from influenza A virus infectedcells by antisera against NPI-1. Infected HeLa cell proteins werelabeled with ³⁵S-methionine and ³⁵S-cysteine, and total cell lysateswere made as described in the text. Complexes of NPI-1 and NP wereprecipitated using anti-NPI-1 sera. Precipitated proteins were thenfractionated by SDS-PAGE and detected by autoradiography.

FIGS. 7-11: Partial DNA sequences of isolated coding regions of fivedifferent proteins that interact withthe NP of influenza A, as detectedusing the interactive trap system in yeast. The proteins whose sequencesare provided are as follows:

FIG. 7: Partial nucleotide sequence of NPI-2 (SEQ ID NO:6).

FIGS. 8A-8C: Partial nucleotide sequence of NPI-3 (SEQ ID NO:7) and theamino acid sequence encoded by the partial nucleotide sequence (SEQ IDNO:8).

FIG. 9: Partial nucleotide sequence of NPI-4 (SEQ ID NO:9).

FIG. 10: Partial nucleotide sequence of NPI-5 (SEQ ID NO:10).

FIG. 11: Partial nucleotide sequence of NPI-6 (SEQ ID NO:11).

FIGS. 12A-12D: Nucleotide sequence of the NS1I-1 gene (SEQ ID NO:12) andthe encoded amino acid sequence of the NS1I-1 protein (SEQ ID NO:13).The sequence of 2572 bp was determined by dideoxy sequencing of twooverlapping clones. The first clone, pK5, was isolated from the yeastlibrary and contains the HeLa cell cDNA comprising nucleotide positions791 to 2572. The second clone, pRACENS1I-1, resulted from the 5′RACEprocedure used to obtain cDNA derived from the 5′-end of NS1I-1 mRNA,and comprises nucleotide positions 1 to 944.

FIG. 13: Northern blot analysis of HeLa cell poly(A)-RNA using anNS1I-1-specific probe.

FIG. 14: Co-precipitation of NS1 protein from extracts ofA/WSN/33-infected MDCK cells by GST-NS1I-1 and glutathione sepharose.Monolayers of MDCK cells were either infected with influenza A/WSN/33virus at an m.o.i. of 10 or mock-infected, and labeled with³⁵S-methionine and cysteine from 5 to 6 hours p.i. Proteins wereextracted and used for binding to glutathione sepharose coated withGST-NS1I-1 (lanes 3 and 8) or GST-protein (lane 6). As controls,extracts were immunoprecipitated with α-NS1 (lane 2), α-M1 (lane 4), ornon-immune serum (lane 5). Proteins were analyzed by SDS gelelectrophoresis and fluorography. Aliquots of the total extractscorresponding to 10 used for the glutathione agarose precipitations areshown (lanes 1 and 7). The positions of virus proteins and molecularweight markers are indicated to the left.

FIGS. 15A-15E: GST-NS1I-1 co-precipitates NS1 proteins of influenza Aand B virus strains. Extracts of ³⁵S-labeled MDCK cells infected withthe influenza viruses A/duck/Alberta/76 (FIG. 15A), A/turkey/Oregon(FIG. 15B), A/Beijing/32/92 (FIG. 15C), A/Berkeley/1/68 (FIG. 15D), andB/Lee/40 (FIG. 15E) were prepared and used in precipitations of viralproteins by glutathione-sepharose coated with GST-NS1I-1 (lanes“GST-K5”) or GST-protein (lanes “GST”) as described in FIG. 14. Inaddition, viral proteins were immunoprecipitated using α-NS1-, α-M1- ornon-immune serum (lanes “α-NS1”, “α-M1”, “NI”, respectively). Analysiswas by SDS gel electrophoresis and fluorography. Aliquots of the totalextracts corresponding to 10 (FIGS. 15C and 15E) or 6.7 (FIGS. 15A, 15B,and 15D), respectively, are also shown (lanes “T”). The positions ofviral proteins are indicated to the right.

FIGS. 16A-16C: Nucleotide sequence of NS1-BP cDNA (SEQ ID NO: 14) andderived amino acid sequence (SEQ ID NO: 15). The sequence of 2752nucleotides was determined by sequencing of two overlapping clones.Nucleotides at positions +1038 to +2215 are derived from the HeLacDNA-insert of the library plasmid p59-1. The 5′-end of the library cDNAis indicated by an arrow. Nucleotides −537 to +1037 were determined bysequencing cloned HeLa cDNA that was generated by 5′RACE. The openreading frame of 619 amino acids spans positions +1 to +1857. Thededuced amino acid sequence is given in single letter codes. The stopcodon is marked by an asterisk.

FIG. 17: Northern blot analysis of poly(A)-selected HeLa cell RNA withan NS1-BP-specific probe. 1 μg of HeLa cell poly(A)-RNA was separated byformaldehyde-agarose gel electrophoresis and immobilized on a nylonmembrane. A ³²P-labeled probe derived from p59-1 was used to detect NS1-BP mRNA by hybridization. RNA size markers are indicated to the left.

FIG. 18: Alignment of the five tandem repeat elements of NS1-BP (SEQ IDNOS:16-20). The PILEUP program of the Genetics Computer Group (GCG) wasused to align repetitive sequences between amino acids 368 and 607 ofNS1-BP. The PRETTY program of GCG was used to determine a consensussequence (SEQ ID NO:2 1). Conserved amino acids are shown in capitalletters. Invariant positions are indicated by bold letters.

FIG. 19: Precipitation of ³⁵S-labeled NS1 protein by GST-NS1 -BP fusionprotein. Radiolabeled NS1 protein was synthesized in coupledtranscription/translation reactions in the presence of ³⁵S-methionineusing pcDNA3-NS1 as a template. The NS1 protein was precipitated byglutathione Sepharose coated with GST (lane GST) or GST-NS1 -BP, whichcarries amino acids 347 to 619 of NS1-BP (lane GST-NS1-BP). Theprecipitates were analyzed by SDS polyacrylamide gel electrophoresis andautoradiography. A 10 aliquot of the total reaction was separated inparallel (T). The positions of molecular weight markers are indicated tothe left.

FIG. 20: Immunoblot analysis of NS1-BP. Confluent monolayers of Hep-2,293 and HeLa cells were lysed in RIPA buffer. Soluble proteins fromequivalent volumes of extract corresponding to 5×10⁴ cells wereseparated by SDS gel electrophoresis, transferred to nitrocellulosemembrane and probed with affinity-purified NS1-BP-specific antibodies.The positions of marker proteins are indicated to the left.

FIG. 21: Intracellular localization of NS1-BP as determined by indirectimmunofluorescence analysis of HeLa cells. Subconfluent HeLa cells werefixed and stained with affinity-purified NS1-BP-specific rabbitantibodies followed by visualization using HTC-conjugated secondaryantibodies.

FIGS. 22A-22I: Intracellular distribution of the cellular NS1-BP, theSC35 protein and the viral NS1 protein in non-infected or influenza Avirus infected HeLa cells. Confocal micrographs show non-infected (FIGS.22A-22C) or influenza A/WSN/33 virus-infected HeLa cells at 10 hrs. p.i.(FIGS. 22D-22I). The intranuclear localization of NS1-BP was visualizedby staining with NS1-BP-specific primary rabbit antibodies andFITC-conjugated secondary antibodies (FIGS. 22A, 22D, 22G). The cellularSC35 protein (FIGS. 22B and 22H) and the viral NS1 protein (FIG. 22E)were labeled by monoclonal mouse antibodies and visualized by TexasRed-conjugated anti-mouse IgG. Micrographs in the third column (FIGS.22C, 22F and 22I) show confocal overlays of the FITC- and TexasRed-signals from the fields to the left.

FIGS. 23A and 23B: Pre-mRNA splicing but not spliceosome assembly isinhibited by truncated NS1-BP. ³²P-labeled MINX pre-mRNA was incubatedin HeLa cell nuclear extract under splicing-compatible conditions in theabsence (lanes NE) or presence of 80 ng/μl affinity-purified GST (lanesGST) or equimolar amounts of affinity purified GST-NS1 (lanes GST-NS1)or GST-NS1-BP fusion protein which carries amino acids 347 to 619 ofNS1-BP (lanes GST-NS1-BP). (A) RNA analysis. RNA was purified fromaliquots of the reactions after a one (lanes 1) or two hour incubationperiod (lanes 2) and analyzed by electrophoresis on denaturing 13polyacrylamide-urea gels. The positions of the pre-mRNA, theintron-exon2 and exonl intermediates, and the spliced mRNA and lariatproducts are indicated to the right. The lower part of the figure showsa longer exposure of the gel. M, ³²P-labeled size marker DNA fragments.(B) Splicing complex analysis. Aliquots of the splicing reactions weretaken after 20, 40 and 60 minutes and heparin was added to a finalconcentration of 1 mg/ml. The samples were analyzed by electrophoresison a native agarose-polyacrylamide gel. The positions of the H-, A- andB-type splicing complexes (Konarska, M. M., et al., 1987. Cell 49:763-774) are indicated on the left.

FIGS. 24A, 24B and 24C: Far-western analysis of VSV-infected cellextracts and VSV virions. A) Coomassie brilliant blue stain of VSVvirion proteins (G=glycoprotein, N=nucleoprotein, and M=matrix protein)present within the pellet (P) and soluble (S) fractions. B) Identicalnitrocellulose filters containing ¹⁴C-labeled protein standards (MW),mock-infected cell extracts (m), VSV (Ind.)-infected cell extracts (V),and the soluble fraction of purified VSV virions (VR). Filters wereprobed with either gst alone (left filter), or gstYAPWW1 (right filter).Cell extracts were prepared in RIPA buffer with 0.1 SDS. C) A longerexposure of lanes (m) and (V) from the right filter in FIG. 1B.

FIG. 25: Diagram of fusion proteins consisting of gst fused to theN-terminal 74 amino acids of VSV (Ind.) M protein. gstVSVM74WT containsa wild type PY motif. gstVSVM74P-A contains a single point mutationresulting in a proline (P) to alanine (A) change. gstVSVM74Y-A containsa single point mutation resulting in a tyrosine (Y) to alanine (A)change.

FIGS. 26A and 26B: Far-western analysis of gstVSVM74WT. A) Coomassiebrilliant blue stain of bacterial cell extracts expressing gstVSVM74WT(indicated by the arrow) when grown under inducing (IN) conditions, andnot expressing gstVSVM74WT when grown under noninducing (U) conditions.B) Three nitrocellulose filters containing the amount of protein seen inpanel A, and probed withgstYAPWW2 (lanes 1, 2, and 3), gstYAPWW1 (lanes4 and 5), and gst alone (lanes 6 and 7). M=¹⁴C-labeled proteinstandards.

FIGS. 27A and 27B: Far-western analysis of wild type and mutant gst-VSVMfusion proteins. A) Coomassie brilliant blue stain of bacterial cellextracts expressing gst alone (lane 2, short arrow), gstVSVM74WT (lane3), gstVSVM74Y-A (lane 4), and gstVSVM74P-A (lane 5). Lane 1 represents¹⁴C-labeled protein standards (MW). The wild type fusion proteinmigrates slightly slower than the mutant proteins due to the presence ofeight additional amino acids encoded by the polylinker region of thisconstruct. B) Nitrocellulose filter containing identical amounts ofprotein as shown in panel A, and probed with gstYAPWW2. The intensity ofthe signals observed for gstVSVM74Y-A (lane 2) and gstVSVM74P-A (lane 3)is approximately 10 of that observed for gstVSVM74WT (lane 1).

FIG. 28: Diagram of fusion proteins consisting of gst fused to aminoacids 1-202 (full-length), 1-69, 1-52 (wild type), or 1-52 (Y-A;tyrosine to alanine mutation) of the rabies virus M protein. Thedesignation of each of the fusion proteins is listed on the left. PPEYis the wild type sequence (E-glutamic acid).

FIGS. 29A, 29B and 29C: Far-western analysis of gst-rabies M fusionproteins. A) Duplicate nitrocellulose filters containing gstRabM69(M1-69; lanes 1 and 3) and gstRabM202 (M202; lanes 2 and 4) were probedwith either gst alone (lanes 1 and 2), or gstYAPWW2 (lanes 3 and 4). B)Coomassie brilliant blue stain of bacterial cell extracts expressinggstRabM52Y-A (Y-A; lane 1) or gstRabM52WT (WT; lane 2) indicated by thearrowhead. C) Identical amounts of gstRabM52Y-A and gstRabM52WT as seenin panel B were immobilized onto duplicate nitrocellulose filters andprobed with either gst alone (lanes 1 and 2), or gstYAPWW2 (lanes 3 and4).

FIGS. 30A, 30B and 30C: Far-western analysis of gst-rabies M fusionproteins and a panel of gst-WW domain fusion proteins. Identical amountsof gst, gstYAPWW1 (YAP1), gstYAPWW2 (YAP2), gstDystrophinWW (DYST.),gstNEDD4WW2 (NEDD2), gstNEDD4WW3 (NEDD3) and gstFE65WW (FE65) wereimmobilized onto nitrocellulose and probed with either gstRabM52WT(Panel A), or gstRabM52Y-A (Panel B). MW=¹⁴C-labeled protein standards.C) Coomassie brilliant blue stain of the gst-WW domain fusion proteinsthat were present on the filters in panels A and B.

FIG. 31A: VSV M budding assay. Radiolabeled lysates from CV-1 cellsreceiving no DNA (mock, lane 2), T7VSVMWT DNA (MWT, lane 3), andT7VSVMY-A DNA (MY-A, lane 4) were immunoprecipitated with polyclonalantiserum against the M protein of VSV and fractioned by SDS-PAGE. Theposition of the M protein of VSV is indicated. MW=¹⁴C-labeled proteinstandards.

FIG. 31B: Radiolabeled proteins released into the media covering cellstransfected with no DNA (mock, lane 1), T7VSVMWT DNA (lane 2), andT7VSVMY-A DNA (lane 3) were immunoprecipitated with polyclonal antiserumagainst the M protein of VSV and fractionated by SDS-PAGE. The relativemigration of the M protein of VSV is indicated.

FIGS. 32A-32C: Indirect immunofluorescence and confocal microscopy oftransfected CV-1 cells. A) CV-1 cells expressing wild type VSV M proteinat 8 hours post-transfection. B) CV-1 cells expressing the VSV M proteincontaining a tyrosine (Y) to alanine (A) mutation within the PY motif at8 hours post-transfection. C) untransfected CV-1 cells. Primarypolyclonal antiserum (identical to that used in the experiment shown inFIG. 8) was directed against the M protein of VSV.

5. DETAILED DESCRIPTION OF THE INVENTION

The present invention relates to the identification of host cellularproteins that interact with viral proteins important to viralreplication and infection; the identification of compounds thatinterfere with the specific interaction of the host cell and viralproteins; and the evaluation and use of such compounds as antivirals inthe treatment of viral infections in animals, including humans.

The invention is described in this section and in the examples, below,for the identification and inhibition of interactions between human hostcell proteins and viral proteins, including influenza virus andrhabdovirus proteins. For clarity of discussion, particular detail isprovided for the isolation of five particular host cell proteins and/orprotein domains. The first such host cell protein is nucleoproteininteractor 1 (NPI-1), a human cell protein that interacts with theinfluenza virus NP protein. The NPI-1 gene and protein, and theprotein's interaction with NP protein are described in detail in theexample in Section 6, below. Other host cell proteins that interact withthe NP protein include, but are not limited to, NPI-2, NPI-3, NPI-4,NPI-5, and NPI-6, and are also described, below. The interactionsbetween NP and the NPI-1 through NPI-6 host cell proteins provide noveltargets for antiviral treatment and serve as excellent models fordetailing the aspects of the invention. However, the principles may beanalogously applied to the identification of other host cell proteinsthat interact with any of the four influenza virus proteins (PA, PB1,PB2, in addition to NP) required for viral RNA replication.

Particular detail is also provided in the examples in Sections 7 and 8,below, for the identification of human cell proteins that interact withthe influenza virus NS1 protein. Section 7 describes the identificationof nonstructural protein 1 interactor 1 (NS1I-1), a human cell proteinthat interacts with NS1. Section 8 describes the identification ofnonstructural protein 1 binding protein (NS1-BP), a human cell proteinthat also interacts with NS1. These interactions provide novel targetsfor antiviral treatment.

As set forth in the example in Section 9, the present invention alsocontemplates identifying interactions between rhabdovirus proteins andhost cell proteins. Section 9 describes the identification of cellularWW-domains as sites of interaction with the rhabdovirus matrix (M)protein. In a particular embodiment, the host cell protein is not a cellsurface receptor protein.

The present invention further contemplates identifying interactionsbetween host cell proteins and other viral proteins required forinfection, including but not limited to, in the case of influenza virus,NEP, HA, NA, M₁, and M₂ proteins, or, in the case of rhabdoviruses, Mprotein, for example.

The principles may also be analogously applied to other RNA viruses bothpositive and negative stranded RNA viruses, both single and doublestranded, including but not limited to single-stranded, positive-senseRNA viruses such as picomaviruses, caliciviruses, astroviruses,togaviruses, flaviviruses, coronaviruses, arteriviruses, andretroviruses; single-stranded, negative-sense RNA viruses such asparamyxoviruses (including parainfluenza viruses, measles virus, mumpsvirus, and respiratory syncytial virus), rhabdoviruses, filoviruses,orthomyxoviruses, orthomyxo-like viruses including Dhori insect virus,bunyaviruses, and arenaviruses; and double-stranded, positive-sense RNAviruses such as reoviruses and birnaviruses, as well as humanimmunodeficiency virus (HIV), members of the herpes virus family, andadenoviruses. The host cell proteins so identified may includecompletely novel proteins, or previously described proteins that havenot yet been shown to interact with viral proteins.

Any method suitable for detecting protein-protein interactions may beemployed for identifying novel viral-host protein interactions, and areconsidered within the scope of the present invention. For example, sometraditional methods are co-immunoprecipitation, crosslinking andcopurification through gradients or chromatographic colurnns. Newermethods result in the simultaneous identification of the genes codingfor the protein interacting with a target protein. These methods includeprobing expression libraries with labeled target protein in a mannersimilar to antibody probing of λgt 11 libraries.

One such method that detects protein interactions in vivo, the yeastinteractive trap system, was successfully used as described herein toidentify the host cell proteins NPI-1 through NPI-6, NS1I-1, and NS1-BPdescribed herein, and is described in detail for illustration only andnot by way of limitation.

The host cell/viral protein interactions identified are consideredtargets for antiviral intervention. Assays, such as the ones describedherein, can be used to identify compounds that interfere with suchinteractions. The compounds so identified that inhibit virus infection,replication, assembly, or release can be used as antivirals. Inaccordance with the invention, a given compound found to inhibit onevirus may be tested for antiviral activity against a wide range ofdifferent viruses that have analogous dependencies on host cellproteins, including but not limited to single-stranded, positive-senseRNA viruses such as picornaviruses, caliciviruses, astroviruses,togaviruses, flaviviruses, coronaviruses, arteriviruses, andretroviruses; single-stranded negative-sense RNA viruses such asparamyxoviruses (including parainfluenza viruses, measles virus, mumpsvirus, and respiratory syncytial virus), rhabdoviruses, filoviruses,orthomyxoviruses, orthomyxo-like viruses including Dhori insect virus,bunyaviruses, and arenaviruses; and double-stranded, positive-sense RNAviruses such as reoviruses and birnaviruses, as well as humanimmunodeficiency virus (HIV), members of the herpes virus family, andadenoviruses.

Elucidation of the roles of the interacting proteins will lead toidentifying other viruses as targets for intervention. For example, asdescribed herein below, NPI-1 is important to the import of viralnucleic acid-protein complex into the nucleus of the host cell.Therefore, methods described below that disrupt this process, throughinterfering with the activity of NPI-1, for example, may be effective intreating viruses with nuclear phases. Such additional viruses include,but are not limited to, human immunodeficiency virus (HIV), members ofthe herpes virus family, and adenoviruses.

The various aspects of the invention are described in the subsectionsbelow with specific reference to host cell proteins that interact withthe influenza viral proteins NP (host cell proteins NPI-1 through NPI-6)and NS1 (host cell proteins NS1I-1 and NS1-BP), and rhabdoviralM-protein (host cell proteins containing WW domains), with particularemphasis on host cell protein NPI-1; however, the invention is notlimited to NPI-1 and encompasses any viral/host cell proteininteractions as targets for therapeutic intervention.

5.1. IDENTIFICATION OF HOST CELL PROTEINS THAT INTERACT WITH VIRALPROTEINS REQUIRED FOR REPLICATION

The gene for the host cell protein NPI-1 was cloned based on its abilityto interact with the influenza A virus NP. The NPI-1 is the humanhomolog of the yeast protein SRP1. Interaction of NPI-1 and NP wasdemonstrated in yeast by the interactive trap system; in vitrocoprecipitation of the NP with a bacterially expressed NPI-1 protein;and in infected cell extracts by coprecipitation of the NP with NPI-1,using anti-NPI-1 sera. The demonstration of this previously unknowninteraction is illustrated in the working examples (see Section 6,infra). The data generated indicate that NPI-1 plays a role in thereplication of influenza A viruses. NPI-1 is the first cellular proteincharacterized that interacts with a protein encoded by influenzaviruses. This role, therefore, makes the inhibition of the NP-NPI-1interaction an excellent target for antiviral therapy. It has not yetbeen demonstrated at what stage in the replication cycle NPI-1functions. The NPI-1 could affect any of a number of NP functions whichmay include: (1) movement of the ribonucleoprotein complex (RNP) to thenucleus; (2) vRNA synthesis, including antiterrnination and elongation;(3) mRNA synthesis, including elongation, polyadenylation, and transportto the cytoplasm; and (4) exit of the RNP from the nucleus during virionassembly.

The fact that both NPI-1 and SRP1 interact with proteins involved in RNAsynthesis implies that there may be fundamental similarities betweencellular DNA-dependent transcription and influenza viral RNA-dependentRNA synthesis. Cellular factors, like NPI-1, may be shared by the viraland the cellular RNA synthesis machinery to perform similar functions.In addition, the NPI-1 may tether the viral RNP to areas of the nuclearmatrix where splicing and polyadenylation of MRNA occur. It should benoted that although NPI-1 was isolated from HeLa cells, this cell lineis not productively infected by influenza A virus. However, HeLa cellssynthesize influenza viral RNAs and proteins (see FIG. 6, lane 3), andhave previously been used to examine viral RNA synthesis (Beaton & Krug,1986, Proc. Natl. Acad. Sci. USA 83:6282-6286).

The viral NP exists in two forms in the infected cell. One form isassociated with ribonucleoprotein complexes (RNP), and the other is afree form (Shapiro & Krug, 1988, J. Virol. 62: 2285-2290). Pol/NPpreparations used in coprecipitation experiments with NPI-1 werepurified over cesium chloride/glycerol gradients (Honda, et al., 1988,J. Biochem. 104: 1021-1026), which dissociate and purify virion proteinsaway from vRNA. As shown herein, (FIG. 4, lane 3) the NP but not thepolymerase proteins were detected oil Coomassie stained gels; however,coprecipitation of the viral polymerase proteins was not rigorouslytested by immunoblot experiments. Only the NP was coprecipitated frominfected HeLa cell extracts (FIG. 6) suggesting that it is free NP thatis bound by NPI-1.

Only one host factor has been assigned a definitive function in thereplication process of a negative strand RNA virus. The cellular caseinkinase II has been shown to phosphorylate the phosphoprotein P of thevesicular stomatitis-virus (VSV) RNA-dependent RNA polymerase. This is astep that appears to be required-in order to activate the viralpolymerase (Barik and Banerjee, 1992, Proc. Natl. Acad. Sci. USA 89:6570-6574; Barik and Banerjee, 1992, J. Virol. 66: 1109-1118).

NPI-1 and SRP 1 are 50% identical and 81% conserved at the amino acidlevel. This is as a very high degree of conservation between proteinsbelonging to organisms as distantly related as humans and yeast, andsuggests that the NPI-1/SRP1 performs a very basic function in the cell.NPI-1 and SRP1 have eight internal repeats, each of approximately42-amino acids (FIG. 3). This repeat, termed the ARM motif, wasoriginally identified in the Drosophila segment polarity gene armadillo(Riggleman, et al., 1989, Genes Dev. 3: 96-113), and it has beenidentified in a number of other proteins including β-catenin,plakoglobin, p120, APC and smGDS (Peifer, et al., 1994, Cell 76:789-791, and references therein). Several ARM proteins are associatedwith cell adhesion structures. Armadillo and its homologues bind to theC-terminal cytoplasmic tail of cadherins, a calcium-dependent class ofcell adhesion molecules (CAMs), linking the CAMs to the underlyingcytoskeleton at cell-cell junctions (McCrea, et al., 1991, Science 254:1359-1361). In contrast to the armadillo protein, SRP1 and NPI-1 appearto be localized to the nucleus. If NPI-1, like SRP1 (Yano, et al., 1992,Mol. Cell. Biol. 12: 5640-5651), is associated with the nuclearmembrane, it is possible that NPI-1 functions to tether viral RNP to thenuclear membranes (Jackson, et al., 1982, Nature 296: 366-368). Itshould be noted that NPI-1 may be related to (or identical with) anuclear protein that has been found to be involved in V(D)J ecombination(Cuomo, et al., 1994, Meeting abstract F015, Keystone Symposium onRecombination).

The carboxyl terminal 265 amino acids of the NPI-1, which weresufficient for interaction with the viral NP, contain four and one-halfARM repeats. Individual repeats, in general, are approximately 30identical with the ARM consensus sequence. This is consistent with thedegree of conservation in ARM repeats of other proteins (Peifer, et al.,1994, Cell 76: 789-791).

Using the same interactive trap system in yeast, five additional DNAsequences, NPI-2, NPI-3, NPI-4, NPI-5, and NPI-6 were isolated thatpartially encode proteins that interact with the NP of influenza Avirus.

Also, using the yeast interactive trap system, DNA sequences encodingthe NS1I-1 protein and the NS1-BP were identified based on theinteraction between these proteins and the NS1 protein of influenza Avirus. NS1I-1 protein is the human homolog of porcine 17β-estradioldehydrogenase. Several proteins with a dehydrogenase function haverecently been shown to be involved in post-transcriptional events ofgene expression (Hentze, 1994, Trends Biochem. Sci. 19: 101-103). Thissupports an important functional role for the NS1I-1 interaction duringthe viral life cycle.

Using oligonucleotide primers and standard PCR protocols to introducepoint mutations within selected protein motifs of the rhabdovirus Mprotein, followed by a budding assay, host cellular proteins containingWW domains that interact with viral M protein were identified (seeSection 9).

The various proteins so identified are listed in Table I.

TABLE I INTERACTING HOST CELL PROTEINS Host Cell Proteins FIG. #Comments NPI-1 FIG. 2 Protein sequence, homologous to SRPI of yeastNPI-2 FIG. 7 Identical to sequences of hnRNP C proteins (Lahiri andThomas, 1986, Nucl. Acids Res. 14: 4077-4094) NPI-3 FIG. 8 NPI-4 FIG. 9NPI-5 FIG. 10 NPI-6 FIG. 11 NS1I-1 FIG. 12. Protein sequence, homologousto porcine 17β-estradiol dehydrogenase NS1-BP FIG. 16 WW domains (SeeSection 8) Note: Subsequent to Applicants' identification of NPI-3,NPI-4, and NPI-5, these sequences were described by other groups anddesignated Rch1, PC4, and BAT1, respectively.

The coding sequence for NPI-2 is identical to sequences coding for thepreviously identified hnRNP C proteins (Lahiri and Thomas, 1986, Nucl.Acids Res. 14: 4077-4094). The NPI-3, NPI-4, NPI-5, and NPI-6 codingsequences were unknown prior to their identification by Applicant. Thecoding sequences for NS1I-1 and NS1-BP are described in detail in theexample in Sections 7 and 8, below. The WW domain was previouslycharacterized by Sudol, 1996, In Blundell, et al., (eds.), Prog.Biophys. Molec. Biol., Vol. 65, Elsevier Science Ltd., Great Britain,pp. 113-132, as detailed in Section 9, below.

The specific binding domains of the viral and host cell protein bindingpartners can be used in accordance with the invention as targets fortherapeutic intervention or screening assays to identify inhibitorycompounds. In addition, peptides corresponding to the binding site ofthe viral or host cell protein can be used as antiviral inhibitors bydisrupting the binding between the binding partners. For example,peptides having amino acid sequences within the viral NP-NLS domain canbe used to disrupt the interaction between NP and NPI-1, and therebyinhibit viral replication, as detailed in the Example in Section 10,below.

In one embodiment, such an inhibitory peptide comprises a 19 amino-acidsequence of an NP-NLS oligopeptide (Ala Ser Gln Gly Thr Lys Arg Ser TyrGlu Gln Met Glu Thr Asp Gly Glu Arg Gln (SEQ ID NO:22); corresponding toamino acids 2-20 of NP). In another embodiment, the protein of theinvention comprises amino acids 1-20 of NP. In another embodiment, theprotein of the invention comprises a subsequence from 8-18 amino acidsin length of the Ala Ser Gln Gly Thr Lys Arg Ser Tyr Glu Gln Met Glu ThrAsp Gly Glu Arg Gln (SEQ ID NO:22) 19-mer of the NP-NLS oligopeptide.Preferably, the protein of the invention comprises an 8 amino-acidsequence Thr Lys Arg Ser Xaa Xaa Xaa Met (SEQ ID NO:23), whichcorresponds to the conserved domain of an NP-NLS peptide. The inventioncontemplates a protein comprising the Ala Ser Gln Gly Thr Lys Arg SerTyr Glu Gln Met Glu Thr Asp Gly Glu Arg Gln (SEQ ID NO:22) 19-mer of theNP-NLS oligopeptide, or an 8-18 amino acid subsequence thereof, in whichone or more of the following amino acids substitutions are made at thefollowing positions:

Position 2: Ser→Thr

Position 3: Gln→Lys

Position 8: Ser→Pro

Position 10: Glu→Gly

Position 15: Asp→Gly

Position 17: Glu→Asp.

The invention also contemplates a protein of the invention comprising asubsequence of the Ala Ser Gln Gly Thr Lys Arg Ser Tyr Glu Gln Met GluThr Asp Gly Glu Arg Gln (SEQ ID NO:22) 19-mer of the NP-NLS oligopeptidein which position 19 (amino acid Gin) is deleted.

The invention contemplates, in addition to the DNA sequences disclosedherein, 1) any DNA sequence that encodes the same amino acid sequence asencoded by the DNA sequences shown in FIGS. 2, 7-12 and 16 any DNAsequence that hybridizes to the complement of the coding sequencesdisclosed herein (see FIGS. 2, 7-12, and 16) under highly stringentconditions, e.g., hybridization to filter-bound DNA in 0.5 M NaHPO₄, 7%sodium dodecyl sulfate (SDS), 1 mM EDTA at 65° C., and washing in0.1×SSC/0.1 SDS at 68° C. (Ausubel, F. M., et al., eds., 1989, CurrentProtocols in Molecular Biology, Vol. 1, Green Publishing Associates,Inc., and John Wiley and Sons, Inc., New York, at p. 2.10.3) and encodesa finctionally equivalent gene product; and/or 3) any DNA sequence thathybridizes to the complement of the coding sequences disclosed herein(see FIGS. 2, 7-12, and 16) under less stringent conditions, such asmoderately stringent conditions, e.g., washing in 0.2×SSC/0.1 SDS at 42°C. (Ausubel, F. M., et al., eds., 1989, Current Protocols in MolecularBiology, Vol. I, Green Publishing Associates, Inc., and John Wiley andSons, Inc., New York, at p. 2.10.3), yet that still encodes afunctionally equivalent gene product.

The invention also encompasses 1) DNA vectors that contain any of thecoding sequences disclosed herein (see FIGS. 2, 7-12, and 16), and/ortheir complements (i.e., antisense); 2) DNA expression vectors thatcontain any of the coding sequences disclosed herein (see FIGS. 2, 7-12,and 16), and/or their complements (i.e., antisense), operativelyassociated with a regulatory element that directs the expression of thecoding and/or antisense sequences; and 3) genetically engineered hostcells that contain any of the coding sequences disclosed herein (seeFIGS. 2, 7-12, and 16), and/or, their complements (ie., antisense),operatively associated with a regulatory element that directs theexpression of the coding and/or antisense sequences in the host cell.Regulatory elements include but are not limited to inducible andnon-inducible promoters, enhancers, operators and other elements knownto those skilled in the art that drive and regulate expression. Theinvention includes fragments of any of the DNA sequences disclosedherein.

Once the host cell proteins are obtained, they can be used to detectinteractions with proteins from other viruses, in accordance with theinvention. The following descriptions are provided to illustrate thisapproach and not by way of limitation. Influenza B virusribonucleoprotein complex was isolated and using a Western immunoblotassay, it was found that the cellular NPI-1 was associated with thiscomplex. This result indicates that NPI-1, isolated based on itsinteraction with influenza A virus NP, also interacts with influenza Bvirus NP. Thus, compounds that inhibit NP-NPI-1 interactions ininfluenza A virus and thereby inhibit influenza A viral infection shouldbe similarly effective as antivirals against influenza B virus.

Host cell genes that are homologous to those identified herein may beobtained by several methods. In some cases, different host cell proteinsthat share the property of interacting with the same viral protein, e.g.influenza A virus NP, may also share genetic homology. Thus, the genesidentified through the interactive trap selection may be homologous toone another.

Once a host cell gene is identified in accordance with the invention,any homologous gene may be obtained using cloning methods well known tothose skilled in the art, including but not limited to the use ofappropriate probes to detect the homologous genes within an appropriatecDNA or gDNA (genomic DNA) library. (See, for example, Sambrook, et al.,1989, Molecular Cloning: A Laboratory Manual, Cold Spring HarborLaboratories, which is incorporated by reference herein in itsentirety.) This method is especially useful for obtaining proteins thatmay not share the property of binding to the same viral protein, but maynonetheless be genetically homologous.

Such homologous proteins may interact with proteins of viruses otherthan the virus used in the interactive trap. For example, a host cellgene whose product was detected through its interaction with aninfluenza A viral protein may be homologous to another gene whoseproduct does not interact with influenza A virus, but which doesinteract with influenza B viral protein. To optimize the detection ofsuch a homologous gene, cDNA libraries may be constructed from cellsinfected with a virus of interest. Besides influenza B virus, thisprocedure may be applied analogously to other viruses as well, includingbut not limited to single-stranded, positive-sense RNA viruses such aspicomaviruses, caliciviruses, astroviruses, togaviruses,: flaviviruses,coronaviruses, arteriviruses, and retroviruses; single-stranded,negative-sense RNA viruses such as paramyxoviruses (includingparainfluenza viruses, measles virus, mumps virus, and respiratorysyncytial virus), rhabdoviruses, filoviruses, orthomyxoviruses,orthomyxo-like viruses including Dhori insect virus, bunyaviruses, andarenaviruses; and double-stranded, positive-sense RNA viruses such asreoviruses and birnaviruses, as well as human immunodeficiency virus(HIV), members of the herpes virus family, and adenoviruses.

5.2. SCREENING ASSAYS FOR COMPOUNDS THAT INTERFERE WITH THE INTERACTIONOF HOST CELL AND VIRAL PROTEINS REQUIRED FOR VIRAL REPLICATION

The host cell protein and the viral protein that interact and bind aresometimes referred to herein as “binding partners”.This term alsoincludes peptide fragments, produced as described in the subsectionsbelow, comprising the binding domain of each respective protein. Any ofa number of assay systems may be utilized. to test compounds for theirability to interfere with the interaction of the binding partners.However, rapid high throughput assays for screening large numbers ofcompounds, including but not limited to ligands (natural or synthetic),peptides, or small organic molecules are preferred. Compounds that areso identified to interfere with the interaction of the binding partnersshould be further evaluated for antiviral activity in cell based assays,animal model systems and in patients as described herein.

The basic principle of the assay systems used to identify compounds thatinterfere with the interaction between the viral and host cell proteinsinvolves preparing a reaction mixture containing the viral protein andthe host cell protein under conditions and for a time sufficient toallow the two proteins to interact and bind, thus forming a complex. Inorder to test a compound for inhibitory activity, the reaction isconducted in the presence and absence of the test compound, i.e., thetest compound may be initially included in the reaction mixture, oradded at a time subsequent to the addition of the viral and host cellprotein; controls are incubated without the test compound or with aplacebo. The formation of any complexes between the viral protein andthe host cell protein is then detected. The formation of a complex inthe control reaction, but not in the reaction mixture containing thetest compound indicates that the compound interferes with theinteraction of the viral protein and host cell protein.

The following two descriptions of assay systems used to identifycompounds that interfere with the interaction between the viral and hostcell proteins are given by way of illustration and not by way oflimitation. In a first illustration, NP-NLS oligopeptide consisting ofamino acids 2-10 of the NLS peptide was used to inhibit viral growth athousand-fold in cultured MDBK cells. Thus, peptides that correspond tothe NP-NLS target site of binding with NPI-1 can inhibit the interactionof NP with NPI-1 and can be used to treat influenza infection. In asecond illustration, NP-NLS was found to bind NPI-1 at a domain of NPI-1corresponding to amino acids 425-538 of NPI-1. These two illustrationsdemonstrate that each peptide binding partner (NP, NPI-1, NPI-2) orfragment thereof can be used for screening for compounds that interactwith a binding domain and thereby interfere with or disrupt the bindinginteraction of NP with NPI-1 and/or NPI-3. Each peptide binding partner(NP, NPI-1, NPI-2) or fragment thereof can also be used directly tocompetitively interfere with or disrupt the binding interaction of NPwith NPI-1 and/or NPI-3 and therefore, to treat influenza infection.

In one embodiment, an NP-NLS peptide is used to disrupt the bindinginteraction of NP with NPI-1 and/or NPI-3. Preferably an amino acidsequence ranging from 8-20 amino acids of the N-terminus of NP is used.Alternatively a 19-mer amino acid sequence (Ala Ser Gln Gly Thr Lys ArgSer Tyr Glu Gln Met Glu Thr Asp Gly Glu Arg Gln (SEQ ID NO:22))corresponding to amino acids 2-20 of the NP protein is used. In oneembodiment, the peptide used to disrupt the binding interaction of NPwith NPI-1 and/or NPI-1 comprises a 19 amino-acid sequence of an NP-NLSoligopeptide (Ala Ser Gin Gly Thr Lys Arg Ser Tyr Glu Gln Met Glu ThrAsp Gly Glu Arg Gln (SEQ ID NO:22); corresponding to amino acids 2-20 ofNP). In another embodiment, the peptide used to disrupt the bindinginteraction comprises amino acids 1-20 of NP. In another embodiment, thepeptide used to disrupt the binding interaction comprises a subsequencefrom 8-18 amino acids long of the Ala Ser Gln Gly Thr Lys Arg Ser TyrGlu Gln Met Glu Thr Asp Gly Glu Arg Gln (SEQ ID NO:22) 19-mer of theNP-NLS oligopeptide. Preferably, the peptide used to disrupt the bindinginteraction comprises an 8 amino-acid sequence Thr Lys Arg Ser Xaa XaaXaa Met (SEQ ID NO:23), which corresponds to the conserved domain of anNP-NLS peptide. The invention contemplates a peptide that is used todisrupt the binding interaction comprising the Ala Ser Gln Gly Thr LysArg Ser Tyr Glu Gln Met Glu Thr Asp Gly Glu Arg Gln (SEQ ID NO:22)19-mer of the NP-NLS oligopeptide, or an 8-18 amino acid subsequencethereof, in which one or more of the following amino acids substitutionsare made at the following positions:

Position 2: Ser→Thr

Position 3: Gln→Lys

Position 8: Ser→Pro

Position 10: Glu→Gly

Position 15: Asp→Gly

Position 17: Glu→Asp.

The invention also contemplates a peptide that is used to disrupt thebinding interaction comprising a subsequence of the Ala Ser Gln Gly ThrLys Arg Ser Tyr Glu Gln Met Glu Thr Asp Gly Glu Arg Gln 19-mer of theNP-NLS oligopeptide in which position 19 (amino acid Gln) is deleted.

The assay components and various formats that may be utilized aredescribed in the subsections below.

5.2.1. ASSAY COMPONENTS

5.2.1.1. PEPTIDES

The host cell protein and viral protein binding partners used ascomponents in the assay may be derived from natural sources, e.g.,purified from cells and virus, respectively, using protein separationtechniques well known in the art; produced by recombinant DNA technologyusing techniques known in the art (see e.g., Sambrook, et al., 1989,Molecular Cloning: A Laboratory Manual, Cold Spring Harbor LaboratoriesPress, Cold Spring Harbor, N.Y.); and/or chemically synthesized in wholeor in part using techniques known in the art; e.g., peptides can besynthesized by solid phase techniques, cleaved from the resin andpurified by preparative high performance liquid chromatography (see,e.g., Creighton, 1983, Proteins: Structures and Molecular Principles, W.H. Freeman & Co., N.Y., pp. 50-60). The composition of the syntheticpeptides may be confirmed by amino acid analysis or sequencing; e.g.,using the Edman degradation procedure (see e.g., Creighton, 1983,Proteins: Structures and Molecular Principles, W.H. Freeman & Co., N.Y.,pp. 34-49).

The peptide fragments should be produced to correspond to the bindingdomains of the respective proteins. Any number of methods routinelypracticed in the art can be used to identify and isolate the protein'sbinding site. These methods include but are not limited to mutagenesisof one of the genes encoding the protein and screening for disruption ofbinding in a co-immunoprecipitation assay, or mutagenesis of the hostcell gene and selecting for resistance to viral infection. Compensatingmutations in the viral gene can be selected that allow for viral grovthin this mutant host. Sequence analysis of the genes encoding therespective proteins will reveal the mutations that correspond to theregion of the protein involved in interactive binding. Alternatively,one protein can be anchored to a solid surface using methods describedin section 5.2.2. infra, and allowed to interact with and bind to itslabeled binding partner, which has been treated with a proteolyticenzyme, such as trypsin. After washing, a short, labeled peptidecomprising the binding domain may remain associated with the solidmaterial, which can be isolated and identified by amino acid sequencing.Also, once the gene for the protein is obtained, short gene segments canbe engineered to express peptide fragments of the protein, which canthen be tested for binding activity and purified or synthesized.

Whether produced by molecular cloning methods or by chemical syntheticmethods, the amino acid sequence of the binding partners which may beused in the assays of the invention need not be identical to thereported sequence of the genes encoding them. The binding partners maycomprise altered sequences in which amino acid residues are deleted,added, or substituted resulting in a functionally equivalent product.

For example, functionally equivalent amino acid residues may besubstituted for residues within the sequence resulting in a change ofsequence. Such substitutes may be selected from other members of theclass to which the amino acid belongs; e.g., the nonpolar (hydrophobic)amino acids include alanine, leucine, isoleucine, valine, proline,phenylalanine, tryptophan, and methionine; the polar neutral amino acidsinclude glycine, serine, threonine, cysteine, tyrosine, asparagine, andglutamine; the positively charged (basic) amino acids include arginine,lysine, and histidine; the negatively charged (acidic) amino acidsinclude aspartic and glutamic acid.

One of the binding partners used in the assay system should be labeled,either directly or indirectly, to facilitate detection of a complexformed between the viral and host cell proteins. Any of a variety ofsuitable labeling systems may be used including but not limited toradioisotopes such as ¹²⁵I, ¹³¹I, ³⁵S or ³H; enzyme labeling systemsthat generate a detectable colorimetric signal or light when exposed tosubstrate such as horseradish peroxidase, alkaline phosphatase,beta-galactosidase, and luciferase; and fluorescent labels such asumbelliferone, fluorescein, fluorescein isothiocyanate, rhodamine,dichlorotriazinylamine fluorescein, dansyl chloride and phycoerythrin.

5.2.1.2. LABELING, IMMOBILIZATION AND DETECTION OF PEPTIDES AND PROTEINS

Where recombinant DNA technology is used to produce the viral and hostcell binding partners of the assay it may be advantageous to engineerfusion proteins that can facilitate labeling, immobilization and/ordetection. For example, the coding sequence of the viral or host cellprotein can be fused to that of a heterologous protein that has enzymeactivity or serves as an enzyme substrate in order to facilitatelabeling and detection. The fusion constructs should be designed so thatthe heterologous component of the fusion product does not interfere withbinding of the host cell and viral protein.

5.2.1.3. LABELING WITH ANTIBODIES

Indirect labeling involves the use of a third protein, such as a labeledantibody, which specifically binds to one of the binding partners, ie.,either the host cell protein or viral protein used. The term “antibody”or “antibodies” as used herein refers to immunoglobulin molecules andimmunologically active portions of immunoglobulin molecules, i.e.,molecules that contain an antigen binding site which specifically bindsan antigen, such as to one of the binding partners, and does not bindother antigens in a sample. Antibodies include, but are not limited to,monoclonal antibodies, polyclonal antibodies, multispecific antibodies,human antibodies, humanized antibodies, chimeric antibodies, singlechain Fv (scFv), single chain antibodies, anti-idiotypic (anti-Id)antibodies, F(ab) fragments, F(ab′)₂ fragments, and epitope-bindingfragments of any of the above.

The antibodies may be generated by any suitable method known in the art.Polyclonal antibodies to an antigen-of-interest can be produced byvarious procedures well known in the art. For example, a polypeptide ofthe invention can be administered to various host animals including, butnot limited to, rabbits, mice, rats, etc. to induce the production ofsera containing polyclonal antibodies specific for the antigen. Variousadjuvants may be used to increase the immunological response, dependingon the host species, and include but are not limited to, Freund's(complete and incomplete), mineral gels such as aluminum hydroxide,surface active substances such as lysolecithin, pluronic polyols,polyanions, peptides, oil emulsions, keyhole limpet hemocyanins,dinitrophenol, and potentially useful human adjuvants such as BCG(bacille Calmette-Guerin) and corynebacterium parvum. Such adjuvants arealso well known in the art.

Monoclonal antibodies can be prepared using a wide variety oftechniquesknown in the art including the use of hybridoma, recombinant, and phagedisplay technologies, or a combination thereof. For example, monoclonalantibodies can be produced using hybridoma techniques including thoseknown in the art and taught, for example, in Harlow et al., Antibodies:A Laboratory Manual, (Cold Spring Harbor Laboratory Press, 2nd ed.1988); Hammerling, et al., in: Monoclonal Antibodies and T-CellHybridomas 563-681 (Elsevier, N.Y., 1981) (said references incorporatedby reference in their entireties). The term “monoclonal antibody” asused herein is not limited to antibodies produced through hybridomatechnology. The term “monoclonal antibody” refers to an antibody that isderived from a single clone, including any eukaryotic, prokaryotic, orphage clone, and not the method by which it is produced.

Methods for producing and screening for specific antibodies usinghybridoma technology are routine and well known in the art. Briefly,mice can be immunized with a polypeptide of the invention or a cellexpressing such peptide. Once an immune response is detected, e.g.,antibodies specific for the antigen are detected in the mouse serum, themouse spleen is harvested and splenocytes isolated. The splenocytes arethen fused by well known techniques to any suitable myeloma cells, forexample cells from cell line SP20 available from the ATCC. Hybridomasare selected and cloned by limited dilution. The hybridoma clones arethen assayed by methods known in the art for cells that secreteantibodies capable of binding a polypeptide of the invention. Ascitesfluid, which generally contains high levels of antibodies, can begenerated by immunizing mice with positive hybridoma clones.

Antibody fragments which recognize specific epitopes may be generated byknown techniques. For example, Fab and F(ab′)2 fragments of theinvention may be produced by proteolytic cleavage of immunoglobulinmolecules, using enzymes such as papain (to produce Fab fragments) orpepsin (to produce F(ab′)2 fragments). F(ab′)2 fragments contain thevariable region, the light chain constant region and the CH1 domain ofthe heavy chain. Antibodies may also be generated using various phagedisplay methods known in the art. In phage display methods, functionalantibody domains are displayed on the surface of a phage particles whichcarry the polynucleotide sequences encoding them. In a particular, suchphage can be utilized to display antigen binding domains expressed froma repertoire or combinatorial antibody library (e.g., human or murine).Phage expressing an antigen binding domain that binds the antigen ofinterest can be selected or identified with antigen, e.g., using labeledantigen or antigen bound or captured to a solid surface or bead. Phageused in these methods are typically filamentous phage including fd andM13 binding domains expressed from phage with Fab, Fv or disulfidestabilized Fv antibody domains recombinantly fused to either the phagegene III or gene VIII protein. Examples of phage display methods thatcan be used to make the antibodies of the present invention includethose disclosed in Brinkman et al., J. Immunol Methods 182:41-50(1995);Ames et al., J. Immunol. Methods 184:177-186(1995); Kettleborough etal., Eur. J. Immunol. 24:952-958 (1994); Persic et al., Gene 187 9-18(1997); Burton et al., Advances in Immunology 57:191-280 (1994); PCTapplication No. PCT/GB91/01134; PCT publications WO 90/02809; WO91/10737; WO 92/01047; WO 92/18619; WO 93/11236; WO 95/15982; WO95/20401; and U.S. Pat. Nos. 5,698,426; 5,223,409; 5,403,484; 5,580,717;5,427,908; 5,750,753; 5,821,047; 5,571,698; 5,427,908; 5,516,637;5,780,225; 5,658,727; 5,733,743 and 5,969,108; each of which isincorporated herein by reference; in its entirety.

Chimeric antibodies can be produced using any technique known to thoseof skill in the art. See e.g., Morrison, Science 229:1202 (1985); Oi etal., BioTechniques 4:214 (1986); Gillies et al., (1989) J. Immunol.Methods 125:191-202; U.S. Pat. Nos. 5,807,715; 4,816,567; and 4,816,397,which are incorporated herein by reference in their entirety. A chimericantibody is a molecule in which different portions of the antibody arederived from different animal species, such as antibodies having avariable region derived from a murine monoclonal antibody and a humanimmunoglobulin constant region.

Human antibodies can be produced using any technique known to those ofskill in the art. Humanized antibodies are antibody molecules fromnon-human species antibody that binds the desired antigen having one ormore complementarity determining regions (CDRs) from the non-humanspecies and a framework regions from a human immunoglobulin molecule.Often, framework residues in the human framework regions will besubstituted with the corresponding residue from the CDR donor antibodyto alter, preferably improve, antigen binding. These frameworksubstitutions are identified by methods well known in the art, e.g., bymodeling of the interactions of the CDR and framework residues toidentify framework residues important for antigen binding and sequencecomparison to identify unusual framework residues at particularpositions. (See, e.g., Queen et al., U.S. Pat. No. 5,585,089; Riechmannet al., Nature 332:323 (1988), which are incorporated herein byreference in their entireties.) Antibodies can be humanized using avariety of techniques known in the art including, for example,CDR-grafting (EP 239,400; PCT publication WO 91/09967; U.S. Pat. Nos.5,225,539; 5,530,101; and 5,585,089), veneering or resurfacing (EP592,106; EP 519,596; Padlan, Molecular Immunology 28(4/5):489-498(1991); Studnicka etal., Protein Engineering 7(6):805-814 (1994);Roguska. et al., PNAS 91:969-973 (1994)), and chain shuffling (U.S. Pat.No. 5,565,332).

Human antibodies can be made by a variety ofmethods known in the artincluding phage display methods described above using antibody librariesderived from human immunoglobulin sequences. See also, U.S. Pat. Nos.4,444,887 and 4,716,111; and PCT publications WO 98/46645, WO 98/50433,WO 98/24893, WO 98/16654, WO 96/34096, WO 96/33735, and WO 91/10741;each of which is incorporated herein by reference in its entirety.

Human antibodies can also be produced using transgenic mice which areincapable of expressing functional endogenous immunoglobulins, but whichcan express human immunoglobulin genes. For example, the human heavy andlight chain immunoglobulin gene complexes may be introduced randomly orby homologous recombination into mouse embryonic stem cells.Alternatively, the human variable region, constant region, and diversityregion may be introduced into mouse embryonic stem cells in addition tothe human heavy and light chain genes. The mouse heavy and light chainimmunoglobulin genes may be rendered non-functional separately orsimultaneously with the introduction of human immunoglobulin loci byhomologous recombination. In particular, homozygous deletion of theJ_(H) region prevents endogenous antibody production. The modifiedembryonic stem cells are expanded and microinjected into blastocysts toproduce chimeric mice. The chimeric mice are then be bred to producehomozygous offspring which express human antibodies. The transgenic miceare immunized in the normal fashion with a selected antigen, e.g., allor a portion of a polypeptide of the invention. Monoclonal antibodiesdirected against the antigen can be obtained from the immunized,transgenic mice using conventional hybridoma technology. The humanimmunoglobulin transgenes harbored by the transgenic mice rearrangeduring B cell differentiation, and subsequently undergo class switchingand somatic mutation. For an overview of this technology for producinghuman antibodies, see Lonberg and Huszar (1995, Int. Rev. Immunol.13:65-93). For a detailed discussion ofthis technology for producinghuman antibodies and human monoclonal antibodies and protocols forproducing such antibodies, see, e.g, PCT publications WO 98/24893; WO96/34096; WO 96/33735; U.S. Pat. Nos. 5,413,923; 5,625,126; 5,633,425;5,569,825; 5,661,016; 5,545,806; 5,814,318; and 5,939,598, which areincorporated by reference herein in their entirety. In addition,companies such as Abgenix, Inc. (Freemont, Calif.) and Genpharm (SanJose, Calif.) can be engaged to provide human antibodies directedagainst a selected antigen using technology similar to that describedabove.

5.2.2. ASSAY FORMATS

The assay can be conducted in a heterogeneous or homogeneous format.Heterogeneous assays involve anchoring one of the binding partners ontoa solid phase and detecting complexes anchored on the solid phase at theend of the reaction. In homogeneous assays, the entire reaction iscarried out in a liquid phase. In either approach, the order of additionof reactants can be varied to obtain different information about thecompounds being tested. For example, test compounds that interfere withthe interaction between the binding partners, e.g., by competition, canbe identified by conducting the reaction in the presence of the testsubstance; i.e., by adding the test substance to the reaction mixtureprior to or simultaneously with the viral protein and host cell protein.On the other hand, test compounds that disrupt preformed complexes, e.g.compounds with higher binding constants that displace one of the bindingpartners from the complex, can be tested by adding the test compound tothe reaction mixture after complexes have been formed. The variousformats are described briefly below.

In a heterogeneous assay system, one binding partner, e.g, either theviral protein or the host cell protein, is anchored onto a solidsurface, and its binding partner, which is not anchored, is labeled,either directly or indirectly. In practice, microtiter plates areconveniently utilized. The anchored species may be immobilized bynon-covalent or covalent attachments. Non-covalent attachment may beaccomplished simply by coating the solid surface with a solution of theprotein and drying. Alternatively, an immobilized antibody specific forthe protein may be used to anchor the protein to the solid surface. Thesurfaces may be prepared in advance and stored.

In order to conduct the assay, the binding partner of the immobilizedspecies is added to the coated surface with or without the testcompound. After the reaction is complete, unreacted components areremoved (e.g., by washing) and any complexes formed will remainimmobilized on the solid surface. The detection of complexes anchored onthe solid surface can be accomplished in a number of ways. Where thebinding partner was pre-labeled, the detection of label immobilized onthe surface indicates that complexes were formed. Where the bindingpartner is not pre-labeled, an indirect label can be used to detectcomplexes anchored on the surface; e.g., using a labeled antibodyspecific for the binding partner (the antibody, in turn, may be directlylabeled or indirectly labeled with a labeled anti-Ig antibody).Depending upon the order of addition of reaction components, testcompounds that inhibit complex formation or that disrupt preformedcomplexes can be detected.

Alternatively, the reaction can be conducted in a liquid phase in thepresence or absence of the test compound, the reaction productsseparated from unreacted components, and complexes detected; e.g., usingan immobilized antibody specific for one binding partner to anchor anycomplexes formed in solution, and a labeled antibody specific for theother binding partner to detect anchored complexes. Again, dependingupon the order of addition of reactants to the liquid phase, testcompounds that inhibit complex or which disrupt preformed complexes canbe identified.

In an alternate embodiment of the invention, a homogeneous assay can beused. In this approach, a preformed complex of the host cell and viralprotein is prepared in which one of the binding partners is labeled, butthe signal generated by the label is quenched due to complex formation(see, e.g., U.S. Pat. No. 4,109,496 by Rubenstein which utilizes thisapproach for immunoassays). The addition of a test substance thatcompetes with and displaces one of the binding partners from thepreformed complex will reset in the generation of a signal abovebackground. In this way, test substances that disrupt the viralprotein-host cell protein interaction can be identified.

For example, in a particular embodiment for NPI-1, NPI-1 can be preparedfor immobilization using recombinant DNA techniques described in section5.2.1., supra. Its coding region can be fused to theglutathione-S-transferase (GST) gene using the fusion vector pGEX-5X-1,in such a manner that its binding activity is maintained in theresulting fusion protein. NP can be purified and used to raise amonoclonal antibody, specific for NP, using methods routinely practicedin the art and described above. This antibody can be labeled with theradioactive isotope ¹²⁵I, for example, by methods routinely practiced inthe art. In a heterogeneous assay, e.g., the GST-NPI-1 fusion proteincan be anchored to glutathione-agarose beads. NP can then be added inthe presence or absence of the test compound in a manner that allows NPto interact with and bind to the NPI-1 portion of the fusion protein.After the test compound is added, unbound material can be washed away,and the NP-specific labeled monoclonal antibody can be added to thesystem and allowed to bind to the complexed binding partners. Theinteraction between NP and NPI-1 can be detected by measuring the amountof radioactivity that remains associated with the glutathione-agarosebeads. A successful inhibition of the interaction by the test compoundwill result in a decrease in measured radioactivity.

Alternatively, the viral binding partner, such as NS1, can be preparedfor immobilization using recombinant DNA techniques described in section5.2.1., supra. The NS1 coding region can be fused to theglutathione-S-transferase (GST) gene using a fusion vector such aspGEX-5X-1, in such a manner that its binding activity is maintained inthe resulting fusion protein. A corresponding host cell binding partner,such as NS1-BP can be purified and used to raise an NS1-BP-specificmonoclonal antibody which can be labeled with a radioactive isotope,using methods described below in section 5.2.1.3. In a heterogeneousassay, e.g., the GST-NS1 fusion protein can be anchored toglutathione-agarose beads. NSI-BP can then be added in the presence orabsence of the test compound in a manner that allows NS1-BP to interactwith and bind to the NS1 portion of the fusion protein. After the testcompound is added, unbound material can be washed away, and theNS1-specific labeled monoclonal antibody can be added to the system andallowed to bind to the complexed binding partners. The interactionbetween NS1 and NS1-BP can be detected by measuring the amount ofradioactivity that remains associated with the glutathione-agarosebeads. A successful inhibition of the interaction by the test compoundwill result in a decrease in measured radioactivity.

In another particular embodiment, for NS1-BP, for example, a GST-NS1-BPfusion protein and NS1 (or conversely, a GST-NS1 fusion protein andNS1-BP) can be mixed together in liquid in the absence of the solidglutathione-agarose beads. The test compound can be added either duringor after the binding partners are allowed to interact. This mixture canthen be added to the glutathione-agarose beads and unbound material iswashed away. Again the extent of inhibition of the binding partnerinteraction can be detected by measuring the radioactivity associatedwith the beads.

In another embodiment of the invention, these same techniques can beemployed using peptide fragments that correspond to the binding domainsof the viral and host cell proteins, respectively, in place of one orboth of the full length proteins. These binding domains can beidentified, as described in section 5.2.1., supra. For example, and notby way of limitation, NS1-BP can be anchored to a solid material asdescribed above in this section by making a GST-NS1-BP fusion proteinand allowing it to bind to glutathione agarose beads. NS1 can be labeledwith a radioactive isotope, such as ³⁵S, and cleaved with a proteolyticenzyme such as trypsin. Cleavage products can then be added to theanchored GST-NS1-BP fusion protein and allowed to bind. After washingaway unbound peptides, labeled bound material, representing the NS1binding domain, can be eluted, purified, and analyzed for amino acidsequence by methods described in section 5.2.1., supra. Peptides soidentified can be produced synthetically or fused to appropriatefacilitative proteins using recombinant DNA technology, as described insection 5.2.1., supra.

In accordance with the invention, a given compound found to inhibit onevirus may be tested for general antiviral activity against a wide rangeof different viruses that have analogous dependencies on host cellproteins. For example, and not by way of limitation, a compound thatinhibits the interaction of influenza virus NS1 with NS1-BP by bindingto the NS1 binding site can be tested, according to the assays describedin section 5.3. infra, against other viruses, particularly those thathave similar proteins, e.g., parainfluenza viruses.

5.3. ASSAYS FOR ANTIVIRAL ACTIVITY

Any of the inhibitory compounds that are identified in the foregoingassay systems may be tested for antiviral activity.

5.3.1. VIRAL GROWTH ASSAYS

The ability of an inhibitor identified in the foregoing assay systems toprevent viral growth can be assayed by plaque formation or by otherindices of viral growth, such as the TCID₅₀ or growth in the allantoisof the chick embryo. In these assays, an appropriate cell line orembryonated eggs are infected with wild-type influenza virus, and thetest compound is added to the tissue culture medium either at or afterthe time of infection. The effect of the test compound is scored byquantitation of viral particle formation as indicated by hemagglutinin(HA) titers measured in the supernatants of infected cells or in theallantoic fluids of infected embryonated eggs; by the presence of viralplaques; or, in cases where a plaque phenotype is not present, by anindex such as the TCID₅₀ or growth in the allantois of the chick embryo,or with a hemagglutination assay.

An inhibitor can be scored by the ability of a test compound to depressthe HA titer or plaque formation, or to reduce the cytopathic effect invirus-infected cells or the allantois of the chick embryo, or by itsability to reduce viral particle formation as measured in ahemagglutination assay.

An inhibitor can also be scored by the ability of a test compound todecrease the amount of a viral protein, e.g., rhabdovirus M protein,released from cells in a functional budding assay, such as the onedescribed in Justice, et al., 1995 (J. Virol., 69, 3156-3160) and usedin the experiments described in the example in section 9.1.5 infra).

5.3.2. ANIMAL MODEL ASSAYS

The ability of an inhibitor to prevent replication of viruses such aspicornaviruses, caliciviruses, astroviruses, togaviruses, flaviviruses,coronaviruses, arteriviruses, retroviruses, paramyxoviruses (includingparainfluenza viruses, measles virus, mumps virus, and respiratorysyncytial virus), rhabdoviruses, filoviruses, orthomyxoviruses,orthomyxo-like viruses including Dhori insect virus, bunyaviruses, andarenaviruses; and double-stranded, positive-sense RNA viruses such asreoviruses and bimaviruses, as well as human immunodeficiency virus(HIV), members of the herpes virus family, and adenoviruses, can beassayed in animal models that are natural or adapted hosts or transgenichosts for the virus. Such animals may include mammals such as pigs,ferrets, mice, monkeys, horses, and primates, or birds. As described indetail in Section 5.5 infra, such animal models can be used to determinethe LD₅₀ and the ED₅₀ in animal subjects, and such data can be used toderive the therapeutic index for the inhibitor of the viral/host cellprotein interaction.

5.4. INHIBITORY COMPOUNDS

Inhibitory compounds identified in the foregoing screening assays whichmay be used in accordance with the invention may include but are notlimited to, polypeptides, peptides, peptidomimetics, antibodies, nucleicacids (e.g, RNA and DNA), and small organic or inorganic molecules andother drugs.

Preferably, the inhibitory compounds have a molecular weight of lessthan 2000 Daltons, more preferably less than 1500 Daltons, even morepreferably less than 1000 Daltons, and most preferably, less than 700Daltons.

Peptides having an amino acid sequence corresponding to the domain ofthe host cell protein that binds to the viral protein may be used tocompete with the native viral protein and, therefore, may be useful asinhibitors in accordance with the invention. Similarly, peptides havingan amino acid sequence corresponding to the domain of the viral proteinthat binds to the host cell protein may be used. Such peptides may besynthesized chemically or produced via recombinant DNA technology usingmethods well known in the art (e.g., Creighton, 1983, Proteins:Structures and Molecular Principles, W. H. Freeman & Co., N.Y., pp.34-49; Sambrook, et al., 1989, Molecular Cloning: A Laboratory Manual,Cold Spring Harbor Laboratories Press, Cold Spring Harbor, N.Y.). Ifdesired, nonclassical amino acids or chemical amino acid analogs can beintroduced as a substitution or addition into such a peptide.Non-classical amino acids include but are not limited to the D-isomersof the common amino acids, α-amino isobutyric acid, 4-aminobutyric acid,hydroxyproline, sarcosine, citrulline, cysteic acid, t-butylglycine,t-butylalanine, phenylglycine, cyclohexylalanine, β-alanine, designeramino acids such as β-methyl amino acids, Cα-methyl amino acids, andNα-methyl amino acids. Further, peptides can be chemically modified byany chemical modification techniques known to those of skill in the art,including but not limited to specific chemical cleavage by cyanogenbromide, trypsin, chymotrypsin, papain, V8 protease, NaBH₄; acetylation,formylation, oxidation, reduction; metabolic synthesis in the presenceof tunicamycin; etc. The peptides may be isolated and purified bystandard methods including chromatography (e.g. ion exchange, affinity,and sizing column chromatography), centrifugation, differentialsolubility, or by any other standard technique for the purification ofpeptides. Lipofectin or liposomes may be used to deliver the peptides tocells.

Alternatively, antibodies that are both specific for the binding domainsof either the host cell or viral proteins and interfere with theirinteraction may be used. Such antibodies may be generated using standardtechniques described in Section 5.2.1., supra, against the proteinsthemselves or against peptides corresponding to the binding domains ofthe proteins. Such antibodies include but are not limited to polyclonal,monoclonal, Fab fragments, single chain antibodies, chimeric antibodies,etc. Where whole antibodies are used, internalizing antibodies arepreferred. However, lipofectin may be used to deliver the antibody or afragment of the Fab region which binds to the viral or host cell proteinepitope into cells. Where fragments of the antibody are used, thesmallest inhibitory fragment that binds to the target protein's bindingdomain is preferred.

5.5. PHARMACEUTICAL PREPARATIONS AND METHODS OF ADMINISTRATION

The identified compounds that inhibit viral replication can beadministered to an animal, preferably a mammal and most preferably ahuman at therapeutically effective doses to treat, prevent or inhibitviral infection. A therapeutically effective dose is the amount of acompound sufficient to inhibit or reduce viral replication, the amountcompound sufficient to induce or enhance an immune response to thevirus, or the amount of a compound sufficient to result in ameliorationof symptoms of viral infection. In a preferred aspect, the compoundadministered to treat, prevent or inhibit viral infection issubstantially purified (e.g., substantially free from substances thatlimit its effect or produce undesired side-effects).

Toxicity and therapeutic efficacy of such compounds can be determined bystandard pharmaceutical procedures in cell cultures or experimentalanimals, e.g., for determining the LD₅₀ (the dose lethal to 50 of thepopulation) and the ED₅₀ (the dose therapeutically effective in 50 ofthe population). The dose ratio between toxic and therapeutic effects isthe therapeutic index and it can be expressed as the ratio LD₅₀/ED₅₀.Compounds that exhibit large therapeutic indices are preferred. Whilecompounds that exhibit toxic side effects may be used, care should betaken to design a delivery system that targets such compounds to thesite of infection in order to minimize damage to uninfected cells andreduce side effects.

The data obtained from the cell culture assays and animal studies can beused in formulating a range of dosage for use in humans. The dosage ofsuch compounds lies preferably within a range of circulatingconcentrations that include the ED50 with little or no toxicity. Thedosage may vary within this range depending upon the dosage formemployed and the route of administration utilized. For any compound usedin the method of the invention, the therapeutically effective dose canbe estimated initially from cell culture assays. A dose may beformulated in animal models to achieve a circulating plasmaconcentration range that includes the IC50 (i.e., the concentration ofthe test compound that achieves a half-maximal infection, or ahalf-maximal inhibition) as determined in cell culture. Such informationcan be used to more accurately determine usefuldoses in humans. Levelsin plasma may be measured, for example, by high performance liquidchromatography. Suitable doses for antibodies range from 0.1 mg/kg to100 mg/kg of body weight (generally 10 mg/kg to 20 mg/kg), and suitabledoses for peptides or polypeptide range from about 0.001 to 30 mg/kgbody weight.

Various delivery systems are known and can be used to administer acompound, e.g., encapsulation in liposomes, microparticles,microcapsules, recombinant cells capable of expressing the compound,receptor-mediated endocytosis (see, e.g., Wu and Wu, 1987, J. Biol.Chem. 262:4429-4432), construction of a nucleic acid as part of aretroviral or other vector, etc. Methods of introduction include but arenot limited to intratumoral, intradermal, intramuscular,intraperitoneal, intravenous, subcutaneous, intranasal, epidural, andoral routes. The compounds may be administered by any convenient route,for example by infusion or bolus injection, by absorption throughepithelial or mucocutaneous linings (e.g., oral mucosa, rectal andintestinal mucosa, etc.) and may be administered together with otherbiologically active agents. Administration can be systemic or local. Inaddition, it may be desirable to introduce the pharmaceuticalcompositions of the invention into the central nervous system by anysuitable route, including intraventricular and intrathecal injection;intraventricular injection may be facilitated by an intraventricularcatheter, for example, attached to a reservoir, such as an Ommayareservoir.

For administration by inhalation, the compounds for use according to thepresent invention are conveniently delivered in the form of an aerosolspray presentation from pressurized packs or a nebuliser, with the useof a suitable propellant, e.g., dichlorodifluoromethane,trichlorofluoromethane, dichlorotetrafluoroethane, carbon dioxide orother suitable gas. In the case of a pressurized aerosol the dosage unitmay be determined by providing a valve to deliver a metered amount.Capsules and cartridges of e.g gelatin for use in an inhaler orinsufflator may be formulated containing a powder mix of the compoundand a suitable powder base such as lactose or starch.

In a specific embodiment, the compound can be delivered in a vesicle, inparticular a liposome (see Langer, Science 249:1527-1533 (1990); Treatet al., in Liposomes in the Therapy of Infectious Disease and Cancer,Lopez-Berestein and Fidler (eds.), Liss, N.Y., pp. 353-365 (1989);Lopez-Berestein, ibid., pp. 317-327; see generally ibid.)

In a specific embodiment, the compound can be delivered in a controlledrelease system. In one embodiment, a pump may be used (see Langer,supra; Sefton, CRC Crit. Ref Biomed. Eng. 14:201 (1987); Buchwald etal., Surgery 88:507 (1980); Saudek et al., N. Engl. J. Med. 321:574(1989)). In another embodiment, polymeric materials can be used (seeMedical Applications of Controlled Release, Langer and Wise (eds.), CRCPres., Boca Raton, Fla. (1974); Controlled Drug Bioa vailability, DrugProduct Design and Performance, Smolen and Ball (eds.), Wiley, New York(1984); Ranger and Peppas, J Macromol. Sci. Rev. Macromol. Chem. 23:61(1983); see also Levy et al., Science 228:190 (1985); During et al.,Ann. Neurol. 25:351 (1989); Howard et al., J. Neurosurg. 71:105 (1989)).In yet another embodiment, a controlled release system can be placed inproximity of the therapeutic target, i.e., the brain, thus requiringonly a fraction of the systemic dose (see, e.g., Goodson, in MedicalApplications of Controlled Release, supra, vol. 2, pp. 115-138 (1984)).

Other controlled release systems are discussed in the review by Langer(Science 249:1527-1533 (1990)).

In a specific embodiment where the compound is a nucleic acid encoding apolypeptide or peptide, the nucleic acid can be administered in vivo topromote expression of its encoded peptide or polypeptide, byconstructing it as part of an appropriate nucleic acid expression vectorand administering it so that it becomes intracellular, e.g., by use of aretroviral vector (see U.S. Pat. No. 4,980,286), or by direct injection,or by use of microparticle bombardment (e.g., a gene gun; Biolistic,Dupont), or coating with lipids or cell-surface receptors ortransfecting agents, or by administering it in linkage to ahomeobox-like peptide which is known to enter the nucleus (see e.g.,Joliot et al., 1991, Proc. Natl. Acad. Sci. USA 88:1864-1868), etc.Alternatively, a nucleic acid can be introduced intracellularly andincorporated within host cell DNA for expression, by homologousrecombination.

The present invention also provides pharmaceutical compositions. Suchcompositions comprise a therapeutically effective amount of a compound,and a pharmaceutically acceptable carrier. In a specific embodiment, theterm “pharmaceutically acceptable” means approved by a regulatory agencyof the Federal or a state government or listed in the U.S. Pharmacopeiaor other generally recognized pharmacopeia for use in animals, and moreparticularly in humans. The term “carrier” refers to a diluent,adjuvant, excipient, or vehicle with which the therapeutic isadministered. Such pharmaceutical carriers can be sterile liquids, suchas water and oils, including those of petroleum, animal, vegetable orsynthetic origin, such as peanut oil, soybean oil, mineral oil, sesameoil and the like. Water is a preferred carrier when the pharmaceuticalcomposition is administered intravenously. Saline solutions and aqueousdextrose and glycerol solutions can also be employed as liquid carriers,particularly for injectable solutions. Suitable pharmaceuticalexcipients include starch, glucose, lactose, sucrose, gelatin, malt,rice, flour, chalk, silica gel, sodium stearate, glycerol monostearate,talc, sodium chloride, dried skim milk, glycerol, propylene, glycol,water, ethanol and the like. The composition, if desired, can alsocontain minor amounts of wetting or emulsifying agents, or pH bufferingagents. These compositions can take the form of solutions, suspensions,emulsion, tablets, pills, capsules, powders, sustained-releaseformulations and the like. The composition can be formulated as asuppository, with traditional binders and carriers such astriglycerides. Examples of suitable pharmaceutical carriers aredescribed in “Remington's Pharmaceutical Sciences” by E. W. Martin. Suchcompositions will contain a therapeutically effective amount of thecompound, preferably in purified form, together with a suitable amountof carrier so as to provide the form for proper administration to thepatient. The formulation should suit the mode of administration.

In a preferred embodiment, the composition is formulated in accordancewith routine procedures as a pharmaceutical composition adapted forintravenous administration to human beings. Typically, compositions forintravenous administration are solutions in sterile isotonic aqueousbuffer. Where necessary, the composition may also include a solubilizingagent and a local anesthetic such as lignocaine to ease pain at the siteof the injection. Generally, the ingredients are supplied eitherseparately or mixed together in unit dosage form, for example, as a drylyophilized powder or water free concentrate in a hermetically sealedcontainer such as an ampoule or sachette indicating the quantity ofactive agent. Where the composition is to be administered by infusion,it can be dispensed with an infusion bottle containing sterilepharmaceutical grade water or saline. Where the composition isadministered by injection, an ampoule of sterile water for injection orsaline can be provided so that the ingredients may be mixed prior toadministration.

For oral administration, the pharmaceutical compositions may take theform of, for example, tablets or capsules prepared by conventional meanswith pharmaceutically acceptable excipients such as binding agents(e.g., pregelatinised maize starch, polyvinylpyrrolidone orhydroxypropyl methylcellulose); fillers (e.g., lactose, microcrystallinecellulose or calcium hydrogen phosphate); lubricants (e.g., magnesiumstearate, talc or silica); disintegrants (e.g., potato starch or sodiumstarch glycollate); or wetting agents (e.g., sodium lauryl sulphate).The tablets may be coated by methods well known in the art. Liquidpreparations for oral administration may take the form of, for example,solutions, syrups or suspensions, or they may be presented as a dryproduct for constitution with water or other suitable vehicle beforeuse. Such liquid preparations may be prepared by conventional means withpharmaceutically acceptable additives such as suspending agents (e.g.,sorbitol syrup, cellulose derivatives or hydrogenated edible fats);emulsifying agents (e.g., lecithin or acacia); non-aqueous vehicles(e.g.,almond oil, oily esters, ethyl alcohol or fractionated vegetableoils); and preservatives (e.g., methyl or propyl-p-hydroxybenzoates orsorbic acid). The preparations may also contain buffer salts, flavoring,coloring and sweetening agents as appropriate.

The compounds can be formulated as neutral or salt forms.Pharmaceutically acceptable salts include those formed with free aminogroups such as those derived from hydrochloric, phosphoric, acetic,oxalic, tartaric acids, etc., and those formed with free carboxyl groupssuch as those derived from sodium, potassium, ammonium, calcium, ferrichydroxides, isopropylamine, triethylamine, 2-ethylamino ethanol,histidine, procaine, etc.

Suppositories generally contain active ingredient in the range of 0.5%to 10% by weight; oral formulations preferably contain 10% to 95% activeingredient.

The invention also provides a pharmaceutical pack or kit comprising oneor more containers filled with one or more of the ingredients ofthepharmaceutical compositions of the invention. Optionally associated withsuch container(s) can be a notice in the form prescribed by agovernmental agency regulating the manufacture, use or sale ofpharmaceuticals or biological products, which notice reflects approvalby the agency of manufacture, use or sale for human administration.

5.5.1. INTRACELLULAR DELIVERY OF PHARMACEUTICAL PREPARATIONS

The identified compounds that inhibit viral replication may beadministered intracellularly to a patient at therapeutically effectivedoses to treat viral infection. In one embodiment, an oligopeptide ofthe invention is targeted directly to the cytoplasm and/or nucleus of acell. In a specific embodiment, the targeting system employed is apenetratin or “trojan peptide” targeting system (Derossi, et al., 1998,Trends Cell Biol. 8:84-87).

The trojan peptide targeting system employs a fusion protein createdusing standard methods of recombinant polypeptide technology. The fusionprotein comprises an oligopeptide sequence to be internalized fused to atrojan peptide internalization vector, e.g., penetratin-1 orAntennapedia homeodomain (Antp-HD). The oligopeptide “cargo” and thetrojan peptide vector may be coupled by several methods including, butnot limited to, 1) chemical synthesis and coupling in which the cargoand the trojan vector are synthesized separately and attached through adisulfide bond; 2) chemical synthesis in tandem in which a peptidiccargo is synthesized in tandem with a trojan vector; and 3) an in vitrorecombination approach in which fusion peptides comprising the thirdhelix of Antp-HD, a tag (e.g., myc) and a cargo are expressed in E. coliand purified (Derossi, et al., 1998, Trends Cell Biol. 8:84-87).

The trojan peptide targeting system may be used to targetinternalization of fusion proteins of at least 100 amino acids, andpossibly longer, depending on the structure of the cargo. Preferably,the fusion proteins are preincubated with small amounts of SDS orpolysialic acid to mask positive charges and prevent peptidedegradation. Such treatment blocks instantaneous internalization, andallows slow diffusion and progressive release when the fusion protein isadministered in the peritoneal cavity (Derossi, et al., 1998, TrendsCell Biol. 8:84-87).

6. EXAMPLE: THE IDENTIFICATION OF NPI-1 AND ITS INTERACTION WITHINFLUENZA NUCLEOPROTEIN

The yeast interactive trap system was used to identify a cellularprotein which interacts with the nucleoprotein of influenza A viruses.This protein, nucleoprotein interactor 1 (NPI-1) is the human homologueof the yeast protein SRP1. SRP1 was previously identified as asuppressor of temperature-sensitive RNA polymerase I mutations (Yano,et. al., 1992, Mol. Cell. Biol. 12:5640-5651). A full length cDNA cloneof NPI-1 was generated from HeLa cell poly A+ RNA. The viral NP, whichhad been partially purified from influenza A/PR/8/34 virus-infectedembryonated eggs, could be coprecipitated from solution by glutathioneagarose beads complexed with a bacterially expressedglutathione-S-transferase (GST)-NPI-1 fusion protein, confirming theresults of the yeast genetic system. Antisera raised against NPI-1identified a 65 kDa polypeptide from total cellular extracts of bothHeLa and MDBK cells. In addition, the viral nucleoprotein wasco-immunoprecipitated from influenza A/WSN/33 virus-infected HeLa cellsby antisera directed against NPI-1, demonstrating an interaction ofthese two proteins in infected cells, and suggesting that NPI-1 plays arole during influenza virus replication.

6.1. MATERIALS AND METHODS

6.1.1. YEAST, BACTERIA AND PLASMIDS

Yeast strain EGY48 (Mata trpl ura3 his3 LEU2::pLEXAop6-LEU2) (Zervos, etal., 1993, Cell 72: 222-232) and plasmids pEG202, pSH18-34, and pRFHM1and the HeLa cell cDNA library constructed in pJG4-5 (Gyuris, et al.,1993, Cell 75: 791-803) were previously described. Similar versions ofthese plasmids and this yeast host strain are available commerciallyfrom Clontech as part of a two fusion protein system. pLexA-NP wasconstructed by subcloning the cDNA of influenza-A/PR/8/34 NP as a LexAtranslational fusion gene into pEG202 (FIG. 1). Yeast strainsconstructed as part of these studies are described in Table 2.Escherichia coli MH3 (trpC araD lacXhsdR galU galK) and W31005 werepreviously described (Hall, et al., 1984, Cell 36: 1057-1065).

6.1.2. SELECTION OF NP INTERACTORS

An interactive trap selection was performed essentially as has beenpreviously described (Gyuris, et al., 1993, Cell 75: 791-803; Zervos, etal., 1993, Cell 72: 222-232). Strain R100 was transformed by the HeLacDNA library using the lithium acetate method (Ito, et al., 1983, J.Bacteriol. 153: 163-168). 2×10⁶ primary yeast transformants wereselected on twelve 25×25 cm² his⁻trp⁻ glucose plates, pooled and storedat −70° C. Library transformants were selected for leu+ phenotype onhis⁻leu⁻-galactose plates; the efficiency of plating was approximately10⁻⁴ leu+ colonies per galactose+ colony. Plasmid DNA was isolated fromleu+ library transformants as described by Hoffman and Winston (Hoffman& Winston, 1987, Gene 57: 267-272) and introduced into MH3 cells byelectroporation. Library plasmids were selected by plating thetransformation mix on 1×A+amp+glucose plates (Miller, 1972, Experimentsin Molecular Genetics, Cold Spring Harbor Laboratory Press, Cold SpringHarbor, N.Y.).

cDNAs were analyzed by checking the specificity of interaction with theNP. Each isolated plasmid was introduced into strains RI101 and R 102.These strains harbor pSH18-34, a reporter plasmid encodingβ-galactosidase with a GAL1 promoter transcriptionally controlled fromupstream LexA binding sites. Strain R102 was used as a negative controlfor NP-specificity of cloned cDNAs. It contains pRFHM1, which encodesLexA fused to a transcriptionally inert fragment of the Drosophilamelanogaster bicoid protein. β-Galactosidase activity was assayed onnitrocellulose replicas of the colonies by freeze fracturing the cellsand incubating in buffer containing5-bromo-4-chloro-3-indolyl-β-D-galactoside (X-gal) (Miller, 1972,Experiments in Molecular Genetics, Cold Spring Harbor Laboratory Press,Cold Spring Harbor, N.Y.). Plasmids which conferred both a leu+ andβ-gal+ phenotypes in the presence of pLexA-NP but not in the presence ofpRFHM1 were saved for further study.

6.1.3. CLONING OF THE 5′ TERMINUS OF NPI-1

The 5′ terminus of NPI-1 was cloned by rapid amplification of cDNA ends(“RACE”) by the method of Frohman (Frohman, 1990, in PCR Protocols: AGuide to Methods and Applications, Innis, et. al., eds., Academic PressInc., San Diego, p. 28-38; Frohman, et al., 1988, Proc. Natl. Acad. Sci.USA 85: 8998-9002). Reverse transcription of 1 μg of poly A+ HeLa cellRNA was performed using the NPI-1 specific oligonucleotide5′GCAAAGCAGGAGAAACCAC3′ (SEQ ID NO:24). First strand cDNA was tailedwith dCTP by terminal transferase. PCR amplification of the reversetranscription product was performed with the nested NPI-1 primer5′GGGTCCATCTGATAGATATGAGAG3′ (SEQ ID NO:25) and the 5′ RACE anchorprimer 5′ CUACUACUACUAGGCCACGCGTCGACTACTACGGGIIGGGIIGGGIIG3′ (SEQ IDNO:26; Gibco/BRL). The PCR product was subcloned into pGEM-T (Promega)and was sequenced by standard protocols. 5′RACE products from threeindependent experiments were cloned and sequenced in order to avoiderrors introduced by PCR.

6.1.4. BACTERIAL EXPRESSION AND PURIFICATION OF GST-NPI-1

The NPI-1 cDNA derived from a HeLa cDNA library was subcloned into theEcoRI and XhoI restriction endonuclease sites of theglutathione-S-transferase fusion vector pGEX-5X-1 (Pharmacia) togenerate the plasmid pGST-NPI-1. Protein was induced from bacterialexpression plasmids in W31005 cells with isopropyl-β-D-galactopyranosideaccording to standard protocols (Smith & Johnson, 1988, Gene 67: 31-40).Bacteria were pelleted 4 h after induction, washed in ice cold phosphatebuffered saline (PBS), and resuspended in one-tenth culture volume PBS+1Triton X-100. Bacteria were lysed on ice with four 15 s pulses in aRaytheon sonicator at an output setting of 1 amp. Insoluble material waspelleted at 50,000×g for 30 min in a Beckman TL-100.3 rotor.

GST-NPI-1 and GST were purified from bacterial lysates onglutathione-agarose beads (Sigma Chemical Corporation.). Beads wereswelled according to the manufacturer's instructions and equilibrated inPBS. Typical binding reactions were done in 500 μl of PBS/0.1 TritonX-100, and included 50 μl bacterial lysate and 10 μl of a 50 slurry ofglutathione-agarose beads. Binding reactions were incubated for 5 min atroom temperature on a rotating wheel. Beads were collected bycentrifugation for 5 s in a microfuge, and were washed three times inPBS.

6.1.5. NP BINDING ASSAY

To assay binding of NP to GST-NPI-1/bead complexes typical reactionswere performed in 500 μl of ice cold PBS+0.05 Nonidet P-40 and containedwashed GST-NP1-1/bead complexes and 10 μg partially purified influenzavirus polymerase and nucleoprotein preparations (Pol/NP). Virus wasprepared from embryonated eggs infected by influenza A/PR/8/34 virus andPOL/NP preparations were purified as previously described (Enami, etal., 1990, Proc. Natl. Acad. Sci. USA 87: 3802-3805; Parvin, et al.,1989, J. Virol. 63: 5142-5152). NP was bound for 1 h at 4° C. on arotating wheel. Beads werecollected by centrifugation for 5 s in amicrofuge, and were washed three times in PBS+0.05 NP-40. Washed beadswere resuspended in 50 μl SDS sample buffer (Sambrook, et al., 1989,Molecular Cloning: A Laboratory Manual, Cold Spring Harbor LaboratoriesPress, Cold Spring Harbor, N.Y.), boiled for 5 min, and pelleted in amicrofuge. 10 μl of each supernatant was separated by electrophoresis ona 12.5 SDS-polyacrylamide gel. Gels were either stained with Coomassieblue or processed for immunoblot analysis. NP was detected byimmunoblotting with the monoclonal antibody HT103.

6.1.6. ANTISERA AND IMMUNOBLOTTING

Polyclonal rabbit antisera against NPI-1 was generated by immunizationof a female NZY Rabbit (Buckshire Farms) with 200 μg of purifiedGST-NPI-1 in complete Freund's adjuvant, followed by two boosts of 100μg in incomplete Freund's adjuvant at three week intervals. Thespecificity of antisera was demonstrated by immunoblot analysis ofGST-NPI-1 in bacterial lysates. Immunoblots were performed by standardmethods (Harlow and Lane, 1988, Antibodies: A Laboratory Manual, ColdSpring Harbor Laboratories Press, Cold Spring Harbor, N.Y.). Sera wereused at a dilution of 1:1000.

6.1.7. VIRUSES AND CELLS

Total cell lysates from HeLa and MDBK cells were generated by directlysing of cells in SDS-sample buffer, followed by shearing ofchromosomal DNA by passage through a 21 ga. syringe. Cytoplasmicextracts were generated by lysing cells in ice cold NP-40 lysis buffer(10 mM Tris-Cl, pH 8.0; 100 mM NaCl; 1 mM EDTA; 1 mM DTT; 1 NonidetP-40; 1 mM 4-(2-aminoethyl)benzenesulfonylfluoride-hydrochloride(Pefabloc)). After 10 min on ice nuclei were removed by centrifugation.Proteins were separated by SDS-PAGE, transferred to nitrocellulose andvisualized by immunoblotting.

To generate infected cell lysates containing metabolically labeledproteins 4×10⁶ HeLa cells were infected with influenza A/WSN/33 virus ata multiplicity of 10 for 45 min at 37° C. Infection was allowed toproceed in DMEM+0.1 BSA for 5 h at which time cells were labeled with 50μCi ³⁵S-methionine+50 μCi ³⁵S-cystine in MEM-cys-met for 1 h. Extractswere prepared by resuspending infected cells in 650 μl ice cold NP-40lysis buffer followed by two 15 s pulses in a Raytheon sonicator todisrupt nuclei. Insoluble cell debris was removed by centrifugation at100,000×g in a TL-100.3 Beckman rotor. 5 μl anti-NPI-1 sera wasincubated on ice for 1 h with 100 μl infected cell lysates. Immunecomplexes were precipitated from solution by incubation withSepharose-4B linked protein G beads (Sigma Chemical Co.) for 1 h. Beadswere collected by centrifugation, washed three times in NP-40 lysisbuffer, and resuspended in SDS-sample buffer. Precipitated proteins wereseparated by SDS-PAGE and visualized by autoradiography.

6.2. RESULTS

6.2.1. ISOLATION OF NPI-1

The interactive trap was used to identify proteins which specificallyinteract with the influenza A virus nucleoprotein (NP). The interactivetrap is one of several genetic systems recently developed which uses themodular nature of transcription activators to detect protein:proteininteractions (Chien, et al., 1991, Proc. Natl. Acad. Sci. USA 88:9578-9582; Dalton & Treisman, 1992, Cell 68: 597-612; Durfee, et al.,1993, Genes Dev. 7: 555-569; Gyuris, et al., 1993, Cell 75: 791-803;Vojtek, et al., 1993, Cell 74: 205-214; Zervos, et al., 1993, Cell 72:222-232). The interactive trap consists of three components: (1) areporter gene that has no basal transcription; (2) a fusion proteinwhich contains a LexA DNA binding domain that is transcriptionallyinert; and (3) proteins encoded by an expression library, which areexpressed as fusion proteins containing an activation domain (FIG. 1A).Interaction of the LexA fusion protein and the fusion protein containingthe activation domain will constitute a bimolecular transcriptionalactivator which, in this case, will confer the ability to grow on medialacking leucine (Gyuris, et al., 1993, Cell 75: 791-803; Zervos, et al.,1993, Cell 72: 222-232). In the absence of this interaction the leu2gene is not transcribed.

The NP gene of influenza A/PR/8/34 virus was subcloned as atranslational fusion gene with the LexA gene into pEG202 to generatepLexA-NP (FIG. 1B). Strain R100 (Table II), which contains pLexA-NP, wastransformed with a HeLa cell cDNA library constructed in pJG4-5. pJG4-5contains an activation domain under control of a GAL1 promoter (Gyuris,et al., 1993, Cell 75: 791-803).

TABLE II YEAST STRAINS USED Strains Genotype EGY48 Mata trp1.ura3.his3LEU2::pLEXAop6-LEU2 R100 EGY48, pLexA-NP(TRP1) R101 EGY48, pLexA-NP,pSH18-34(HIS3) R102 EGY48, pRFHM1(TRP1), pSH18-34

Library plasmids were rescued from 100 leu+ colonies. Reproducibility ofthe interaction of the NP with the encoded library proteins was testedby transforming library plasmids into strain R101. Transformants werescreened for galactose-dependent β-galactosidase activity and growth onmedia lacking leucine. Specificity for NP was analyzed by checking theability of library plasmids to confer growth on leu⁻ media andβ-galactosidase activity in connection with a different LexA fusionplasmid, pRFHM1, encoding a fragment of the Drosophila melanogasterbicoid protein. Twenty-three library plasmids were confirmed to encodeNP-interactive proteins. Twelve identical 2.1 kbp clones encoded thecarboxy terminal fragment of a protein termed nucleoprotein interactor 1(NPI-1). Partial DNA sequencing showed that NPI-1 is the human homologueof the yeast SRP1 gene (infra).

6.2.2. CLONING AND SEQUENCING OF THE NPI-1 cDNA

The 2.1 kbp NPI-1 cDNA in pJG4-5 was sequenced by standard protocols.The 5′cDNA terminus of the NPI-1 gene was cloned by 5′RACE. cDNAs from 3independently derived NPI-1 5′RACE products were cloned and sequenced.Nucleotide and derived amino acid sequences of NPI-1 are shown in FIG.2. The sequence reveals a 2.9 kbp cDNA which encodes a protein of 527amino acids with a calculated molecular weight of 58,754 Da and apI=4.74. The carboxyl terminal 265 amino acids were encoded by theinteractive trap library plasmid and interact with the viral NP.

Comparison of the deduced amino acid sequences in the GenBank and EMBLdatabases using the FASTA and TFASTA programs (Deveraux, et al., 1984,Nucleic Acids Res. 12:387-395) demonstrated that NPI-1 is the humanhomologue of the Saccharomyces cerevisiae protein SRP1 (Yano, et al.,1992, Mol. and Cell. Biol. 12: 5640-5651). SRP1 was cloned as anallele-specific suppressor of ts mutations in the zinc-binding domain ofthe A190 subunit of RNA polymerase I. The amino. acid sequence is highlyconserved between NPI-1 and SRP1: 50% identity and 81% similarity at theamino acid level. The amino terminus of NPI-1 has a potential nuclearlocalization signal (Chelsky, et al., 1989, Mol. Cell. Biol.9:2487-2492); amino acids 25 to 49 are rich in arginine, and contain astretch of four consecutive arginines at amino acids 28 to 31. NPI-1,like SRP 1, contains a series of 8 consecutive ARM motifs, which are 42amino acid protein subsequences originally identified in the Drosophilaarmadillo protein (Peifer, et al., Cell 76: 789-791, 1994; Yano, et al.,1992, Mol. and Cell. Biol. 12: 5640-5651) (FIGS. 3A-3B).

6.2.3. NPI-1 BINDS TO NP IN VITRO

In order to demonstrate that the NPI-1 binds to the viral NP, the NPI-1cDNA fragment (amino acids 262 to 527) was subcloned into the bacterialexpression vector pGEX-5X-1 yielding a glutathione S-transferase fusiongene. The expressed fusion protein was purified from bacterial lysateson glutathione agarose beads. NP, which had been partially purified withthe viral polymerase from influenza A/PR/8/34 virus was specificallyprecipitated from solution by glutathione agarose beads complexed withGST-NPI-1 (FIG. 4). The NP band migrates slightly faster than that ofthe GST-NPI-1 fusion protein. The identity of this protein was confirmedby immunoblot analysis using the anti-NP monoclonal antibody HT103 (FIG.4, lane 8).

6.2.4. IMMUNODETECTION OF NPI-1 IN CELL EXTRACTS

Rabbit antisera raised against GST-NPI-1 were used to identify apolypeptide from total cellular extracts of both HeLa and MDBK cellswith an apparent molecular weight of 65 kDa (FIG. 5). The molecularweight predicted from the derived amino acid sequence of the CDNA isslightly smaller (59 kDa). A lower amount of NPI-1 was present in thecytoplasmic fraction-generated by lysis of cells in the presence ofNP-40 than in the total cellular extract suggesting that most of NPI-1is located in the nucleus (FIG. 5). This is consistent with resultslocalizing the NPI-1 homologue SRPI to the nucleus of yeast cells byimmunofluorescence (Yano, et al., 1992, Mol. and Cell. Biol. 12:5640-5651). Localization of NPI-1 to a particular intracellularcompartment by immunofluorescence experiments has not been possible dueto the high background fluorescence of the antisera preparations used.

6.2.5. NPI-1 INTERACTS WITH NP IN INFECTED CELLS

Since NP formed a complex with NPI-1 in vitro, whether NP and NPI-1 forma complex in infected cells was examined. NP was specificallycoimmunoprecipitated from extracts of influenza A/WSN virus infectedHeLa cells by antisera directed against NPI-1 (FIG. 6). Thisdemonstrates an interaction of the viral NP and the cellular NPI-1during influenza A virus infection.

7. EXAMPLE THE IDENTIFICATION OF NS1I-1 AND ITS INTERACTION WITHINFLUENZA NS1 PROTEIN

In the example described below, the yeast interactive trap system wasused to identify a human protein, NS1I-1 (NS1-interactor-l), from a HeLacell cDNA library on the basis of its binding to NS1 of influenza Avirus. NS1I-1 is shown herein to be recognized not only by NS1 proteinsfrom five human and avian influenza A strains, but also by NS1 ofinfluenza B virus. Surprisingly, NS1I-1 is homologous to a steroiddehydrogenase isolated from pigs (Leenders, et al., 1994, Eur. J.Biochem. 222: 221-227). Several proteins with a dehydrogenase functionhave recently been shown not only to have enzymatic activity but also tobe involved in post-transcriptional events of gene-expression (Hentze,1994, Trends Biochem. Sci. 19: 101-103). This strong conservationsupports an important functional role of the NS1I-1 interaction duringthe viral life cycle.

7.1. MATERIALS AND METHODS

7.1.1. YEAST E. COLI STRAINS, AND PLASMIDS

Manipulations of nucleic acids, Escherichia coli and yeast followedessentially standard procedures as described elsewhere (Ausubel, et al.,1992, Current Protocols in Molecular Biology, Green PublishingAssociates, Inc., and John Wiley & sons, Inc., New York). The yeaststrains EGY40 (Mata irpl ura3 his3) and EGY48 (Mata trpl ura3 his3LEU2::pLEX-Aop6-LEU2) as well as plasmids pEG202, pRFHM1, and pSH18-34,and the HeLa cell cDNA constructed in pJG4-5 have been described(Gyuris, et al., 1993, Cell 75: 791-803; Zervos, et al., 1993, Cell 72:222-232). E. coli strains used for cloning and expression were MH3 (trpCaraD lacX hsdR galU galK), DH5α (F⁻Φ80dlacZΔM 15 Δ(lacZY−argF)U169 deoRrecA1 endA1 hsdR17(r_(K)−m_(K)+) supE44λ-thi- gyrA96 relA1), and BL26(F⁻ompT hsdS_(B)(r_(B) ⁻m_(B) ⁻) gal dcm). pLexA-NS1 was constructed bysubcloning the cDNA of the NS segment of influenza virus A/PR/8/34downstream of the LexA gene in pEG202. pGEX-NS1I-1 was constructed bysubcloning the HeLa cDNA-insert of library plasmid pK5 as anEcoRI/XboI-fragment into pGEX-5X-l (Pharmacia). DNA-oligonucleotidesused were: GSP-I, 5′-dTCCTGATGTTGCTGTAGACG-3′ (SEQ ID NO:27), GSP-II,5′-dGCACGACTAGTATGATTTGC-3′ (SEQ ID NO:28), and the 5′RACE anchor primer(BRL), 5′-dCUACUACUACUAGGCCACGCGTCGACTACTACGGGIIGGGIIGGGIIG-3′ (SEQ IDNO:26).

7.1.2. IDENTIFICAFION OF NS1-INTERACTORS

The interactive trap selection was performed essentially as describedfor NPI-1 in Section 6.1.2, above. The selection strain was constructedby transforming EGY48 with the bait plasmid pLexA-NS1 and thelacZ-reporter plasmid pSH18-34. Expression of lacZ from pSH18-34 istranscriptionally controlled by a GAL1 promoter and LexA-dependentoperator sites. A HeLa cell cDNA library was introduced into theselection strain using the lithium acetate method (Ito, et al., 1983, J.Bacteriol. 153: 163-168). Primary transformants were selected ontrp⁻his⁻ura⁻ glucose plates. 1×10⁶ cells representing 3.3×10⁵independent transformants were plated on 150 mmtrp⁻his⁻ura⁻leu⁻-galactose plates to select for clones expressingNS1-interacting proteins. Viable cells were replica-transferred to anitrocellulose filter and assayed for β-galactosidase activity using5-bromo-4-chloro-3-indolyl β-D-galactoside (X-gal) as described(Ausubel, et al., 1992, Current Protocols in Molecular Biology, GreenPublishing Associates, Inc., and John Wiley & sons, Inc., New York).Positive clones were tested in a second round of selection by replicaplating onto X-gal trp⁻his⁻ura⁻ galactose plates. Plasmid DNA wasisolated from yeast clones expressing β-galactosidase activity only ongalactose plates and library plasmids were recovered by transformationinto E. coli MH3 as described in Section 6.1.2, above. The specificityof the isolated plasmids was tested by co-transformation with pLexA-NS1or pRFHM1 into EGY40 harboring pSH18-34. pRFHM1 expresses an unrelatedLexA-bicoid fusion protein. The resulting strains were assayed forβ-galactosidase activity on X-gal trp⁻his⁻ura⁻ plates containing glucoseor galactose. Plasmids that induced β-galactosidase only in the presenceof galactose and only in conjunction with pLexA-NS1 were considered toencode true interacting proteins.

7.1.3. CLONING OF NS1I-1 5′-END cDNA

Cloning of cDNA derived from the 5′-end of NS1I-1 mRNA followed aRACE-procedure (rapid amplification of cDNA ends) (Frohman, et al.,1988, Proc. Natl. Acad. Sci. USA 85: 8998-9002) using a 5′RACE-kit(BRL). First strand CDNA was synthesized from 1 μg of HeLa cellpoly(A)-RNA hybridized to 2.5 pmol NS1I-1-specific oligonucleotide GSP-1using reverse transcriptase. The cDNA was tailed at the 5′-end with dCby terminal transferase. The product was used as a template for theamplification of a 5′RACE-product by PCR using a nested oligonucleotideGSP-II and an anchor primer provided by the kit. The resulting fragmentwas subcloned in pGEM-T (Promega) to form pRACENS1I-1, and sequenced bythe standard dideoxy method. The NCBI-search was conducted using Fasta,Tfasta. Sequence comparison was conducted using Bestfit.

7.1.4. NORTHERN BLOT ANALYSIS

1 μg of HeLa cell poly(A)-RNA was separated on a 1 agarose-fornaldehydegel, transferred to a nylon membrane (Nytran, Amersham), andUV-crosslinked. The RNA was hybridized to a ³²P-labeled, NS1I-1 specificprobe derived form a fragment (corresponding to positions +791 to +1745)of the original pK5 library isolate as described (Ausubel, et al., 1992,Current Protocols in Molecular Biology, Green Publishing Associates,Inc., and John Wiley & sons, Inc., New York).

7.1.5. VIRUSES, CELLS, AND EXTRACTS

Influenza strains A/WSN/33 (H1N1), A/Berkeley/1/68 (H2N2),A/Beijing/32/92 (H3N2), A/duck/Alberta/76 (N12N5), A/turkey/Oregon/71(H7N5), and B/Lee/40 were grown in the allantoic cavity of 10 days oldembryonated chicken eggs. Confluent monolayers of Madin Darby caninekidney-(MDCK)-cells were infected with influenza viruses at an m.o.i. of10 for one hour in 35 mm dishes. Infection was continued at 37° C.(influenza A viruses) or 35° C. (influenza B/Lee/40) for 5 hours inMEM-medium containing 0.1 bovine serum albumin. Cells were labeled with100 μCi of ³⁵S-methionine and ³⁵S-cysteine (ICN) per dish for one hourin MEM-met⁻cys⁻medium. Cells were washed and scraped in ice-coldphosphate buffered saline (PBS). Cells from one dish were lysed with 500μl NET-N buffer (10 mM Tris/HCl pH 8.0, 1 mM EDTA, 150 mM NaCl, 0.05Nonidet P 40) and two 30 second pulses in a Raytheon sonicator at asetting of 1A. Lysates were centrifuged for 10 minutes at 20,000 rpm ina TL100.3 rotor. The supernatants were used for precipitation ofproteins.

7.1.6. EXPRESSION OF GST-NS1I-1 FUSION PROTEIN IN E. COLI ANDPRECIPITATION OF VIRAL PROTEINS FROM CELL EXTRACTS

NS1I-1 was expressed in E. coli BL26 from pGEX-NS1I-1 as a GST(glutathione-S-transferase)-NS1I-1 fusion protein with a predictedmolecular weight of 77 kDa. Production of GST-NS1I-1 was induced usingisopropyl-β-D-galactopyranoside essentially as described (Smith &Johnson, 1988, Gene 67: 31-40). GST-NS1I-1 was adsorbed from bacteriallysates to glutathione sepharose beads (Pharmacia) as recommended by themanufacturer. Beads-were washed three times with PBS to removecontaminating proteins. 10 μl of glutathione sepharose coated withGST-NS1I-1 fusion protein was rotated with 100 μl extract ofvirus-infected MDCK-cells (see above) in 750 μl NET-100 buffer (20 mMHepes, pH 8.0, 100 mM NaCl, 0.5 mM DTT) for 90 minutes at 4° C. Thebeads were washed three times with PBS/0.05 NP-40 and precipitatedproteins were analyzed by SDS-gel electrophoresis and autoradiography.In parallel reactions, viral proteins were immunoprecipitated from 50 μlof infected cell extracts using 5 μl of anti-NS1 or anti-M1 antiserumand protein A agarose as described (Harlow and Lane, 1988, Antibodies: ALaboratory Manual, Cold Spring Harbor Laboratories Press, Cold SpringHarbor, N.Y.). As a negative control, GST protein was expressed in BL26from pGEX-5X-1 and used the same way in the co-precipitation assay.

7.2. RESULTS

7.2.1. ISOLATION OF NS1-INTERACTING FACTORS

The yeast interaction trap system (Gyuris, et al., 1993, Cell 75:791-803; Zervos, et al., 1993, Cell 72: 222-232) was used to identifycellular proteins that interact with the non-structural protein NS1 ofinfluenza A virus. A LexA-NS1 fusion protein was used as bait to screenlibrary in which HeLa cell cDNAs were expressed as fusions with anacidic transcription activation domain (Gyuris, et al., 1993, Cell 75:791-803, #159). Colonies were selected, in which either of two reportergenes, LEU2 and lacZ, were activated by the cDNA-encoded proteins. Thisdouble selection scheme was used to increase the stringency, because inan initial screen a high proportion of candidates scored negative insubsequent genetic tests. The library plasmids were isolated from theselected clones.

The binding specificity of the encoded fusion proteins was tested byassaying the activation of a lacZ-reporter gene encoded on pSH18-34.Expression of β-galactosidase from this plasmid is transcriptionallycontrolled by LexA-specific operator sites. The isolated libraryplasmids were co-transformed with pLexA-NS1 or pRFHM1 into EGY40harboring pSH18-34. pRFHM1 expresses a LexA-bicoid fusion protein andwas used as a non-specific operator-binding control. The resultingstrains were assayed for β-galactosidase activity specifically on X-galplates containing galactose, but not glucose. From 3.3×10⁵ independentlibrary transformants, three plasmids were isolated that inducedgalactose-specific activation of the lacZ reporter gene only incombination with pLexA-NS1. Sequence analysis indicated that the threeplasmids were each derived from different cellular cDNAs.

7.2.2. CLONING AND SEQUENCE ANALYSIS OF NS1I-1

One of the isolated plasmids, pK5, was analyzed further. It carried acDNA-insert of 1781 bp with an open reading frame of 1413 nucleotidesfollowed by 368 nucleotides of a potentially untranslated region (FIGS.12A-12D). The cDNA terminated with an oligo(A)-tract and had a consensuspoly(A)-site at positions 2526-2531. Northern blot analysis of HeLa cellpoly(A)-RNA using a NS1I-1-specific probe detected one single transcriptof about 3.0 kb suggesting that the pK5 insert represented an incompleteCDNA (FIG. 13). The remaining NS1I-1 cDNA was cloned by a 5′RACEprocedure (Frohman, et al., 1988, Proc. Natl. Acad. Sci. USA 85:8998-9002). Four independent clones were sequenced that differed only inlength at the very 5′-end. The longest 5′RACE product, contained inpRACENS1I-1, extended the NS1I-1 sequence for 893 nucleotides upstreamtotaling in a cDNA of 2674 bp (FIGS. 12A-12D). The sequence has one longopen reading frame encoding a protein of 735 amino acids with apredicted molecular mass of 79.5 kDa and a pI of 9.06. The putativeATG-start codon is located 103 nucleotides downstream of the 5′-end andis in the context of a sequence consistent with its being atranslational start (Kozak, 1989, J. Cell Biol. 108: 229-241).

Sequence comparisons through the EMBL- and Genbank databases using theFASTA- and TFASTA-analysis programs revealed that NS1I-1 is highlyhomologous to porcine 17β-estradiol dehydrogenase (Leenders, et al.,1994, Eur. J. Biochem. 222: 221-227). The two cDNAs are 86 identical onthe nucleic acid level. The encoded proteins are 84 identical and are 92similar when allowing for conserved amino acid substitutions. NS1I-1cDNA also shows strong homology to ten human cDNA fragments that havebeen isolated as expressed sequence tags, as revealed by aBLAST-analysis of the NCBI-database (fragments are between 134 to 556 bpin length). These cDNAs were derived from different tissues includingliver, spleen, brain, adipose tissue, and adrenals tissue indicating abroad expression of NS1I-1 in the body.

The encoded NS1I-1 protein features two conserved sequence motifs of theshort-chain alcohol dehydrogenase family (Persson, et al., 1991, Eur. J.Biochem. 200: 537-543). Specifically, amino acids 15 to 22 (TGAGAGCG;SEQ ID NO:29) are similar to the potential co-factor binding site, andresidues 163 to 167 (YSAAK; SEQ ID NO:30) correspond to a short stretchthat has been suggested to participate in catalysis (Chen, et al., 1993,Biochemistry 32: 3342-3346). The presence of the tri-peptide AKL at thecarboxy-terminus was also noted. Similar tri-peptide motifs have beenfound to serve as targeting signals for import into microbodies (for areview, see de Hoop & Ab, 1992, Biochem. J. 286: 657-669). However, thepresence of this signal does not automatically direct a protein to theseorganelles (de Hoop & Ab, 1992, Biochem. J. 286: 657-669).

7.2.3. NS1I-1 BINDS NS1 PROTEIN FROM EXTRACTS OF INFLUENZA A VIRUSINFECTED CELLS

In order to confirm a physical interaction between NS1I-1 protein andNS1 expressed in influenza virus infected cells, a co-precipitationassay was performed as similarly described in Section 6.2.3, above, forNPI-1. A glutathione-S-transferase (GST)-NS1I-1 fusion gene wasconstructed and expressed in E. coli GST-NS1I-1 fusion protein frombacterial lysate was absorbed to the affinity matrix glutathione agaroseand purified from contaminating bacterial proteins. The immobilizedfusion protein was used to bind and precipitate ³⁵S-labeled proteinsfrom extracts of MDCK cells infected with human influenza A/WSN/33viruses (FIG. 14). The NS1 protein of this strain is 98 identical to itscounterpart from A/PR/8/34 used in the yeast interaction screen.Aliquots of the same extract were used to in parallel reactions toimmunoprecipitate influenza virus proteins NS1 and M1. The precipitatedproteins were analyzed by SDS-gel electrophoresis and visualized byfluorography. FIG. 14 shows, that GST-NS1I-1 efficiently precipitated aprotein band co-migrating with immunoprecipitated NS1 protein frominfected cell extract (compare lanes 2 and 3). This interaction wasspecific for NS1I-1 since no proteins were detected in precipitatesusing GST only (lane 6). In addition, no proteins were precipitated byGST-NS1I-1 from mock-infected cells (lane 8), showing that a virusinduced protein was recognized by NS1I-1. This experiment confirmed,that NS1I-1 interacts specifically with the NS1 protein of influenza Avirus.

If this interaction is important for the viral life-cycle one wouldexpect it to be conserved. Consequently, the binding of NS1I-1 to NS1proteins from other influenza A strains should be detectable despite oftheir considerable variation in the primary structure (Baez, et al.,1981, Virology 113: 397-402; Ludwig, et al., 1991, Virology 183:566-577). Therefore the interaction between NS1I-1 and NS1 was examinedusing the same co-precipitation assay described above, with extractsfrom cells infected with different influenza A and B virus strains.

Mutations accumulate in the NS1 gene at a steady rate over time(Buonagurio, et al., 1985, Science 232: 980-982). Thus, the time-spanbetween the isolation of two strains is reflected in the sequencevariation of its NS1 proteins (Ludwig, et al., 1991, Virology 183:566-577; Buonagurio, et al., 1985, Science 232: 980-982). NS1I-1 bindingto NS1 proteins from two recently isolated human influenza A strainsA/Beijing/32/92 and A/Berkeley/1/68 was examined. As can be seen inFIGS. 15C and 15D, respectively, NS1 proteins fron both strains werespecifically precipitated (FIGS. 15C and 15D, lanes “GST-K5”). A lowimmunoprecipitation efficiency of NS1 protein from the Beijing-strain(FIG. 15C) was reproducibly observed. The NS1 proteins ofAJBerkeley/1/68and A/WSN/33 are 90.8 identical to each other. The NS1 sequence ofA/Beijing/32/92 is not known.

The following analyses were conducted to examine whether GST-NS1I-1 isalso recognized by the more divergent NS1 proteins of the avianinfluenza strains Aduck/Alberta/76 and A/turkey/Oregon/71. The NS1proteins of these strains are 66.5 and 63.6 identical, respectively, toA/WSN/33. Significantly, NS1 of A/turkey/Oregon/71 is only 124 aminoacids in length, lacking most of the carboxy-terminal half of other NS1proteins, which consist of 207 to 237 amino acids (Norton, et al., 1987,Virology 156: 204-213). Nevertheless, precipitation of a protein bandco-migrating with NS1 from both strains was observed (FIGS. 15A and 15B,lanes “GST-K5”). The NS1 and M1 proteins of A/duck/Alberta/76 could notbe separated by the gel system used. Significant amounts ofnucleoprotein in the GST-NS1I-1 precipitates of these avian strains werereproducibly detected for undetermined reasons.

Finally, the co-precipitation assay was used to test the human influenzaB virus B/Lee/40. Surprisingly, GST-NS1I-1 precipitated specifically theinfluenza B virus NS1 protein, although it is only 20.6 identical to NS1from A/WSN/33 (FIG. 15, Panel E, lane “GST-K5 ”). Taken together, thebinding of GST-NS1I-1 to NS1 proteins expressed by several influenza Aand B virus stains could be demonstrated, despite the great divergenceof their primary structures. This result strongly supports an importantfunction of this interaction during the viral life cycle, and indicatesthat the NS1I-1 interaction is an excellent target for antiviralintervention.

8. EXAMPLE THE IDENTIFICATION OF NS1-BINDING PROTEIN (NS1-BP) AND ITINTERACTION WITH INFLUENZA NS1 PROTEIN

In the example described below, the yeast interactive trap system(Gyuris, et al., 1993, Cell 75: 791-803; Zervos, et al., 1993. Cell 72:223-232) was used to screen for cellular proteins that interact with theNS1 protein and to identify and characterize a human 70 kDa protein,termed NS1-binding protein (NS1-BP), which interacts with thenon-structuial NS1 protein of the influenza A virus. The geneticinteraction was confirmed by the specific coprecipitation of the NS1protein from solution by a GST-NS1-BP fusion protein and glutathioneSepharose in vitro. NS1-BP contains an N-terminal BTB/POZ domain andfive kelch-like tandem repeat elements of 50 amino acids.

In non-infected cells, affinity-purified antibodies localized NS1-BP indiscrete nuclear domains which are enriched in spliceosome assemblyfactor SC35, a pre-mRNA splicing factor. This suggests an association ofNS1-BP with the cellular splicing apparatus. However, in influenza Avirus-infected cells, NS1-BP relocalized throughout the nucleoplasm andappeared distinct from the SC35 domains which suggests that NS1-BPf3nction may be disturbed or altered. The addition of a truncated NS1-BPmutant protein inhibited pre-mRNA splicing in HeLa cell nuclear extractsin vitro, possibly as the result of a dominant-negative effect on theendogenous protein, but it did not inhibit spliceosome assembly. Theseresults suggest a role for NS1-BP in pre-mRNA splicing and support amodel in which the NS1/NS1-BP interaction has a role in mediating thesplicing-inhibitory effect of the NS1 protein.

8.1. MATERIALS AND METHODS

8.1.1. VIRUSES, CELLS AND EXTRACTS

Influenza A/WSN/33 virus was grown in the allantoic cavity of 10-day-oldembryonated chicken eggs. HeLa, Hep-2 and 293 cells were passaged inDulbecco's modified Eagle's tissue culture medium (D-MEM) containing 10fetal calf serum. For immunoblot analysis, confluent cell monolayersgrown in 35 mm dishes were lysed in RIPA buffer containing 150 mM NaCl,1.0 Nonidet P-40, 0.5 deoxycholate, 0.1 sodium dodecyl sulfate, 50 mMTris/HCl, pH8.0. Lysates were clarified by centrifugation for 10 min at13,000 g and supernatants were used for immunoblot analysis.

8.1.2. YEAST STRAINS, E. COLI STRAINS AND PLASMIDS

E. coli strains used for cloning and expression were MH3 (trpC araD lacXhsdR galU galK), DH5∝(F⁻Φ8odlacZΔM15 Δ(lacZYA−argF)U169 deoR recA1 endA1hsdR17(r_(K) ⁻m_(K) ⁺) supE44λ⁻thi-1 gyrA96 relA1), BL26 (F⁻ ompThsdS_(B) (r_(B) ⁻m_(B) ⁻) gal dcm) and XL1 Blue recA1 endA1 gyrA96 thi-1hsdR17 supE44 rel A1 lac {F′ pro AB lacIqZΔM15 TnlO (Tet )}.Saccharomyces cerevisiae EGY48 (Mata trp1 ura3 his3LEU2::pLEX-Aop6-LEU2), plasmids pSH18-34, pRFHM1 and the HeLa cell cDNAexpression library constructed in pJG4-5 were kindly provided by R.Brent (Harvard Medical School) and have been described previously(Gyuris, et al., 1993, Cell 75: 791-803; Zervos, et al., 1993, Cell 72:223-232). The constructions of plasmids pLexA-NS1, pcDNA3-NS1 andpGEX-NS1 have been described elsewhere (Wolff, et al., 1996, J. Virol.70: 5363-5372). SP6-MNX (Zilimann, et al., 1988, Mol. Cell. Biol. 8:814-821) was used as a template for transcription of synthetic pre-mRNA.Construction of plasmids followed standard cloning procedures (Ausubel,et al., 1992, Current protocols in molecular biology. John Wiley & Sons,New York). Plasmid pGEX-NS1-BP was made by subcloning the HeLa cDNA fromthe library plasmid (see below) into pGEX-SX-1 (Pharmacia). Thebacterial expression plasmids pGEX-NS1-BP-REP and pMAL-NS1-BP-REP weregenerated by inserting NS1-BP cDNA corresponding to amino acids 1-368(nucleotide positions 1 to 1104) between the EcoRI/XhoI-sites ofpGEX-5X-1 (Pharnacia) and the EcoRI/SalI-sites of pMAL-c2 (New EnglandBiolabs), respectively.

8.1.3. IDENTIFICATION AND ISOLATION OF NS1-INTERACTING cDNA CLONES USINGTHE YEAST INTERACTION TRAP

The yeast interaction trap was used to identify and to isolate HeLa cellcDNAs encoding NS1 binding factors as was previously described (Wolff,et al., 1996, J. Virol. 70: 5363-5372). In brief, EGY48 was transformedwith the bait plasmid pLexA-NS1 and the lacZ reporter plasmid pSHl18-34.Subsequently, this strain was transformed with a plasmid library, inwhich HeLa cell cDNAs were conditionally expressed as fusions with anacidic activation domain from a GAL1 promoter. 3.3×10⁵ primarytransformants were screened for interaction as determined by theirability to grow on minimal synthetic medium in the absence of leucineand to activate the lacZ reporter gene specifically on plates containinggalactose but not glucose. The library plasmid p59-1 was isolated fromone selected clone by transformation in E. coli MH3 as describedelsewhere (O'Neill, et al., 1994, Virology 206: 116-125). Thespecificity of the interaction was examined by retransformation of p59-1into EGY48-harbouring pSH18-34 together with pLexA-NS1 or with pRFHM1,which expresses an unrelated fusion of LexA with the bicoid protein ofD. melanogaster. p59-1 activated the lacZ reporter gene specifically inthe presence of galactose in combination with pLexA-NS1, but not withpRFHM1. p59-1 was sequenced using a standard chain termination protocol.

8.1.4. CLONING OF NS1-BP 5′-END cDNA

cDNA corresponding to the 5′-end of NS1-BP mRNA was obtained by a5′-RACE (rapid amplification of cDNA ends) procedure using a 5′RACE kit(Gibco-BRL). 2.5 pmol of the specific DNA oligonucleotide 59GSP1(dCATTCCTCTCTGTTATAGCC (SEQ ID NO:31), corresponding to positions 1123to 1142 of NS1-BP cDNA) was hybridized to 1 μg of HeLa poly(A)⁺-RNA toprime first strand cDNA synthesis by M-MLV reverse transcriptase. ThecDNA was tailed with dC using terminal transferase. The product was usedas a template to amplify double stranded cDNA by PCR with the nestedprimer 59GSP2 (dCCACCTGCAGCTATGAG (SEQ ID NO:32), positions 1108 to1124) and the 5′RACE anchor primer. The resulting product was subclonedinto pGEM-T (Promega) to generate pGEM-NS1-BP-5′RACE plasmids. TheNS1-BP cDNA was sequenced by the standard dideoxy method.

8.1.5. NORTHERN BLOT ANALYSIS

1 μg of HeLa cell poly(A)⁺-RNA was electrophoresed on a 1agarose-formaldehyde gel, transferred onto a Nytran (Amersham) nylonmembrane and immobilized by UV-crosslinking. A ³²P-labeledNS1-BP-specific probe comprising positions 1038 to 2215 was used todetect NS1-BP MRNA by hybridization as described (Ausubel, et al., 1992,Current protocols in molecular biology. John Wiley & Sons, New York).

8.1.6. CO-PRECIPITATION OF NS1 PROTEIN WITH GST-NS1-HP BY GLUTATHIONESEPHAROSE

NS1-BP (amino acids 347-619) was expressed from pGEX-NS1-BP as aglutathione-S-transferase (GST) fusion protein in E. coli BL26.Synthesis of GST-NS1-BP was induced by addition of 1 mMisopropyl-β-D-galactopyranoside (IPTG). Bacterial cell lysate containingthe GST-NS1-BP fusion protein was adsorbed to glutathione Sepharose(Pharmacia) according to the protocol supplied by the manufacturer.Contaminating proteins were removed by three washes withphosphate-buffered-saline (PBS). NS1 protein was synthesized and labeledwith [³⁵S]-methionine in coupled 50 μl transcription/translationreactions (Promega-TNT, Promega) programmed with pcDNA3-NS1. Thetranslation reaction was mixed with 10 μl coated glutathione Sepharosebeads in 750 μl of HN100 buffer (20 mM Hepes, pH8.0, 100 mM NaCl, 0.01Nonidet P40 [NP-40]) for 2 h at 4° C. The beads were washed three timeswith PBS/ 0.01 NP-40 and the precipitated proteins were separated by SDSgel electrophoresis and visualized by autoradiography.

8.1.7. ANTI-NS1-BP-SERUM AND IMMUNOBLOT ANALYSES

The GST-NS1-BP-REP fusion protein carrying amino acids 1-368 of NS1-BPwas expressed in E. coli BL26 transformed with pGEX-NS1-BP-REP andaffinity-purified on glutathione Sepharose resin (Pharmacia) asrecommended by the manufacturer. A six month old female rabbit wasimmunized with 400 μg purified GST-NS1-BP-REP fusion protein in completeFreund's adjuvant followed by booster injections of 250 μg fusionprotein in incomplete adjuvant at a four week interval NS1-BP-specificantibodies were purified from serum by affinity-chromatography using anantigen-resin. For the construction of this matrix, a MAL-NS1-BP-REPfusion protein in which the maltose-binding protein of E. coli was fusedto amino acids 1-368 of NS1-BP was expressed in E. coli XL1-Blue cellsand affinity-purified on an amylose affinity column (New EnglandBiolabs). The MAL-NS1-BP-REP fusion protein was immobilized onCNBr-activated Sepharose (Pharmacia) and the resulting resin was usedfor the affinity-purification of NS1-BP-specific antibodies as describedelsewhere (Harlow, et al., 1988, Antibodies: A Laboratory Manual, ColdSpring Harbor Laboratory, Cold Spring Harbor, N.Y.). The purifiedantibodies were diluted 1:200 for immunoblot experiments using standardprocedures (Harlow, et al., 1988, Antibodies: A Laboratory Manual, ColdSpring Harbor Laboratory, Cold Spring Harbor, N.Y.).

8.1.8. INDIRECT IMMUNOFLUORESCENCE MICROSCOPY

HeLa cells were grown to 50 confluency on glass cover slips in D-MEMcontaining 10 fetal calf serum. Where indicated, cells were infected ata multiplicity of 10 with influenza A/WSN/33 virus diluted in PBS forone hour at 37° C. Infection was continued under tissue culture mediumat 37° C. Cells were processed for immunofluorescence analysis byfixation in 2.5 methanol-free formaldehyde (Polysciences Inc.) dilutedin PBS and permeabilization of cells in 0.1 Triton X-100 was done asdescribed (Wolff, et al., 1996, J. Virol. 70: 5363-5372). Cells werestained with primary antibodies diluted in PBS-3 bovine serum albumin.Affinity-purified anti-NS1-BP antibodies and the NS1-specific monoclonalantibody IA7 (a kind gift of Jonathan Yewdell, National Institutes ofHealth) were used at 1:100 dilutions. The anti-SC35 antibody (Fu, etal., 1990, Nature 343: 437-441) was purchased from Pharmingen Inc. andused at a dilution of 1:1000. The cells were washed and incubated withfluorescein isothiocyanate (FITC)-conjugated sheep anti-rabbitimmunoglobulin G (IgG) and/or Texas Red-conjugated goat anti-mouse IgG.Subsequently, the coverslips were washed and mounted in MOWIOL 4-88(Calbiochem) as described (Harlow, et al., 1988, Antibodies: ALaboratory Manual. Cold Spring Harbor Laboratory, Cold Spring Harbor,N.Y.). For conventional immunofluorescence analysis, cells were viewedon a Zeiss Axiovert 100 fluorescence microscope using a 63× objectiveand photographs were captured by a CF8/lO× video camera (Kappa GmBH). AZeiss LSM 410 Invert microscope equipped with a 100× objective lens wasused for confocal laser scanning microscopy. Digitized images werepseudocolored using PHOTOSHOP® software (Adobe Systems Inc.).

8.1.9. SPLICEOSOME ASSEMBLY AND SPLICING OF A ³²P-LABELLED PRE-mRNA INTHE PRESENCE OF GST PROTEINS

GST GST-NS1 and GST-NS1-BP fusion proteins were expressed in E. coliBL26 and affinity-purified on glutathione Sepharose (Pharmacia) columnsas recommended by the manufacturer. GST proteins were eluted with 20 mMglutathione in 50 mM Tris/HCI, pH 8.0, dialyzed versus buffer D (20 mMHepes, pH 8.0, 100 mM KCl, 20 glycerol, 0.2 nM EDTA, 0.5 mM DTT) andstored at −80° C. The purity of the prepared proteins was tested by SDSgel electrophoresis and staining by Coomassie Blue. HeLa cell nuclearextract was prepared as described (Dignam, et al., 1983, Nucleic AcidsRes. 11: 1475-1489). ³²P-labelled MINX pre-mRNA was synthesized asdescribed (Wolff, et al., 1992, EMBO J. 11: 345-359). In a typicalsplicing reaction 4 ng of pre-mRNA were incubated in a 100 μl volumecontaining 40 HeLa cell nuclear extract, 3.2 mM MgCl₂, 0.5 mM ATP, 20 mMcreatine phosphate, 60 mM KCl. 8 μg of GST or equimolar amounts of GSTfusion proteins were added where indicated and the reactions wereincubated at 30° C. The formation of splicing complexes was analyzedafter treatment with heparin (1 mg/ml) by electrophoresis on nativeacrylamide/agarose gels (Nelson, et al., 1988, Genes Dev. 2: 319-329).For RNA analysis, splicing products were purified and analyzed byelectrophoresis on denaturing 13 acrylamide-urea gels.

8.1.10. SEQUENCE COMPARISONS

The NS1-BP cDNA and its derived amino acid sequence were compared to theGenBank and EMBL databases using the FASTA and TFASTA software(Deveraux, et al., 1984, Nucleic Acids Res. 12: 387-395). The PILEUP andPRETTY programs of the Genetics Computer Group (University of Madison,Wisconsin) were used to align the repeat elements of NS1-BP and tocreate a consensus sequence.

8.2. RESULTS

8.2.1. ISOLATION OF NS1 BINDING FACTORS

The yeast interaction trap system (Gyuris, et al., 1993, Cell 75:791-803: Zervos, et al., 1993, Cell 72: 223-232) was used to identifycellular proteins that bind to the NS1 protein of the influenza A virus(Wolff, et al., 1996, J. Virol. 70: 5363-5372). A constitutivelyexpressed LexA-NS1 fusion protein was used to screen a HeLa cell cDNAplasmid library, in which cDNA-encoded proteins were conditionallyexpressed as translational fusions with an acidic activation domain froma GAL1 promoter. Expression of the acidic domain fusion proteins isinduced in the presence of galactose and repressed by glucose. 3.3×10⁵primary yeast transfoimants were screened for the galactose-dependentactivation of LEU2 and lacZ reporter genes, which are regulated byLexA-specific operator sites. Three library plasmids were isolated fromselected transformants that reproduced the interacting phenotype uponretransformation with pLexA-NS1, but not with the control plasmidpRFHM 1. The analysis of the human cDNA isolated through one of theselibrary plasmids, p59-1, which encodes a novel human protein, NS1-BP isdescribed in the following subsections.

8.2.2. CLONING AND ANALYSIS OF NS1-BP cDNA

p59-1 had a 1.2 kb cDNA insert containing one long open reading frame of819 bp followed by 338 bp of an untranslated region that terminated in arun of 20 adenosines (FIGS. 16A-16C). Northern blot analysis of HeLacell poly(A)-RNA was used to determine if the size of the isolated HeLacDNA corresponded to a complete copy of NS1-BP mRNA. A ³²P-labeledNS1-BP-specific probe hybridized mainly to an RNA species ofapproximately 3.1 kb in size (FIG. 17). This result suggested that p59-1carried an incomplete copy of NS1-BP mRNA. A 5′RACE (rapid amplificationof cDNA ends) procedure to generate cDNA derived from the 5′- end ofNS1-BP mRNA was employed. The RACE-products were subcloned and sixresulting plasmid clones were isolated and sequenced. The longest 5′RACEclone extended the NS1-BP cDNA to a total of 2752 bp (FIGS. 16A-16C).Sequence analysis revealed the presence of one long open reading frameof 1857 nucleotides that encodes a 619 amino acid protein with apredicted molecular mass of 69.7 kDa. The initiator ATG codon of theopen reading frame is in a sequence context which is compatible withbeing a translational start site (Kozak, 1989, J. Cell Biol. 108:229-241). Analysis of the sequence of NS1-BP revealed the presence offive imperfect repeat elements of47-49 amino acids at the C-terminalregion between amino acids 368 to 607 (FIG. 18). These tandem repeatsare 18 to 41 identical to each other and five positions are invariantbetween domains.

The cDNA and the deduced amino acid sequence of NS1-BP were compared tosequences in the GenBank and EMBL databases using the FASTA and TFASTAalgorithms (Deveraux, et al., 1984, Nucleic Acids Res. 12: 387-395). Tworegions of NS1-BP were identified which had homology to other proteins.First, the N-terminal ˜120 amino acids of NS1-BP are homologous to theBTB (bric-a-brac, tramtrack, broad complex)/POZ (poxviruses and zincfingers) domain that was identified in several zinc finger proteinsknown to act as transcriptional regulators (Bardwell, et al., 1994,Genes Dev. 8: 1664-1677; Zollmann, et al., 1994, Proc. Natl. Acad. Sci.USA 91: 1071-7-10721). Second, the five tandem repeats located betweenNS1-BP residues 368 and 607 are homologous to the 50 amino acid kelchmotif that was originally found in the Drosophila Kelchprotein (Bork, etal., 1994, J. Mol. Biol. 236: 1277-1282; Xue, et al., 1993, Cell 72:681-693). The Kelch protein is a component of the intercellular ringcanals in the Drosophila egg chamber. Its function is required for thedevelopment of fertile oocytes since mutations in the kelch gene cancause a sterile phenotype in females (Xue, et al., 1993, Cell 72:681-693). Interestingly, the Kelch protein also contains a predictedBTB/POZ domain. In total, the NS1-BP is 31 identical in amino acidsequence to Kelch.

Several other proteins were identified which have both kelch and BTB/POZdomains. These include the murine ENC-1 protein which is specificallyexpressed in the nervous system (Hernandez, et al., 1997, J. Neurosci17: 3038-3051), human and bovine calicin, components of the mammaliansperm head (von Bülow, et al., 1995, Exp. Cell. Res. 219: 407-413), thepredicted product of the human KIAAO132 gene (Nagase, et al., 1995, DNARes. 2:167-174) and the proteins encoded by genes of vaccinia virus(A55R, C2L, and F3L) (Goebel, et al., 1990, Virology 179: 247-266), theShope fibroma virus (T6, T8 and T9) (Upton, et al., 1990, Virology 179:618-631), variola major virus (D16L, C7L, J6R, B20R) (Massung, et al.,1994, Virology 201: 215-240) and swine pox virus (C4L, C13L) (Massung,et al., 1993, Virology 197: 511-528). The functions of the viral geneproducts are not known. Several cellular kelch-repeat proteinscontaining no BTB/POZ domains were found, including the α- and β-scruinproteins which are expressed in the sperm of the horseshoe crab L.polyphemus (Way, et al., 1995, J. Cell Sci. 108: 3155-3162; Way, M., etal., 1995. J. Cell Biol. 128: 51-60), the products of the mouseintracistemal A particle-promoted placenta (MIPP) gene (Chang-Yeh, etal., 1991, Nucl. Acids Res. 19: 3667-3672) and the spe26 gene of C.elegans (Varkey, et al., 1995, Genes Dev. 9:1074-1086).

8.2.3. THE NS1 PROTEIN BINDS TO NS1-BP IN VITRO

To confirm the interaction of NS1 and the NS1-BP in vitro, bindingassays were erformed. NS1-BP cDNA isolated through the library plasmidin the interaction trap screen (corresponding to NS1-BP amino acids347-619) was fused to the glutathione-S-transferase (GST) gene in abacterial expression vector. GST-NS1-BP fusion protein was expressed inE. coli and adsorbed to glutathione Sepharose beads. As a control,glutathione Sepharose eads were prepared that were complexed with GSTprotein alone. The NS1 protein was synthesized in vitro and labeled with³⁵S-methionine through coupled transcription/translation reactions inreticulocyte lysates. The coated glutathione Sepharose beads wereincubated with the radiolabeled NS1 protein. The NS1 protein wasefficiently precipitated by the GST-NS1-BP fusion protein, but not byGST (FIG. 19, lanes GST, GST-NS1-BP). This result confirms the yeast twohybrid data and shows that the viral NS1 protein can also physicallyinteract with the cellular NS1-BP.

8.2.4. NS1-BP IS CONCENTRATED IN INTRANUCLEAR DOMAINS ENRICHED INSPLICING FACTORS

Polyclonal rabbit antibodies were raised against recombinant NS1-BP andused to analyze the concentration and intracellular localization ofNS1-BP in mammalian cells. Immunoblot analyses of the humanepithelial-derived Hep-2, 293 and HeLa cell lines by NS1-BP-specificantibodies detected a protein doublet band with a molecular mass ofabout 70 kDa (FIG. 20). This is the predicted size for a protein derivedfrom the NS1-BP open reading frame. Two minor protein bands migrating at65 kDa and 50 kDa were stained at variable intensity and may correspondto NS1-BP break-down products. NS1-BP-specific antibodies wereaffmity-purified from immune serum and used for immunofluorescenceanalysis. In HeLa cells, a punctate nuclear staining pattern thatexcluded the nucleoli was observed (FIG. 21). In addition, a weakdiffuse staining of the cytoplasm was reproducibly seen. The nuclearstaining of NS1-BP was similar in appearance to the ‘speckled’ patternthat was obtained by immunofluorescence staining of cells withantibodies directed against factors involved in pre-mRNA splicing(Spector, et al., 1991, EMBO J. 10: 3467-3481). The speckle domainscorrespond ultrastructurally to interchromatin granules andperichromatin fibrils and are enriched in splicing snRNPs and non-snRNPsplicing factors like SC35 and other SR proteins (reviewed by Spector,1993, Annu. Rev. Cell Biol. 9: 265-315). To determine how the NS1-BPlocalization relates to speckle domains HeLa cells were doubleimmunostained with NS1-BP-specific antibodies and a monoclonal antibodyraised against the spliceosome assembly factor SC35 which is a knowncomponent of speckle domains (Fu, et al., 1990, Nature 343: 437441).Confocal laser scanning microscopy demonstrated that the dot-likenuclear NS1-BP signal colocalized with the SC35 signal (FIG. 22 A-C).The concentration of multiple proteins involved in pre-mRNA rocessing inthese nuclear regions suggests an important role of this compartment forcellular RNA biogenesis (Singer, et al., 1997. Cell 91: 29.1-294). Theaccumulation of NS1-BP in the same compartment suggests that NS1-BP maybe a component of the cellular splicing machinery.

8.2.5. NS1-BP RELOCALIZES TO THE ENTIRE NUCLEOPLASM IN INFLUENZA AVIRUS-INFECTED CELLS

The viral NS1 protein accumulates in the nucleus of cells infected withinfluenza A virus (Greenspan, et al., 1988, J. Virol. 62: 3020-3026,Young, et al., 1983, Proc. Natl. Acad. Sci. USA 80: 6105). Hence,experiments were performed that examined whether the speckled nuclearlocalization of the cellular NS1-binding protein would be affected invirus-infected cells expressing the NS1 protein. Influenza Avirus-infected cells were double immunostained with antibodies directedagainst the NS1-BP and the viral NS1 or the cellular SC35 protein,respectively (FIGS. 22A-22I). As expected, the NS1 protein localizedpredominantly to the nucleoplasm with some additional nucleolar signal(FIG. 22E). For the NS1-BP staining, a remarkable change was observedafter infection by influenza virus. The cellular NS1-BP was no longerfound concentrated in the nuclear speckles, but was instead distributedthroughout the nucleoplasm. Its distribution pattern was similar to thatof the viral NS1 protein except that there was no nucleolar signal(FIGS. 22D and 22F). This redistribution of NS1-BP was observed in a fewcells as early as four hours post infection. With ongoing infection,most of the cells expressing the viral NS1 protein had an NS1-BPstaining pattern similar to the one shown in FIG. 22D. The intensity ofthe nuclear NS1-BP signal appeared to increase slightly in infectedcells. However, no increase in the amount of NS1-BP in virus-infectedcells by immunoblotting (data not shown) was detected. This suggeststhat NS1-BP epitopes are more easily accessible to antibodies in thenuclei of infected cells.

The intranuclear relocalization of NS1-BP in infected cells raised thequestion of whether the distribution of other proteins that normallylocalize to speckles would also change. Gross redistribution of proteinsmight occur if speckles break down during influenza virus infection.However, the staining of virus-infected cells with anti-SC35 antibody at10 hours post infection (FIG. 22H) showed only a small change of thenormal pattern. The average size of the speckles appeared to be slightlydecreased with a concomitant increase in the number of these domains.Essentially the same observation was made in a previous study thatexamined the distribution of splicing factors in influenza Avirus-infected cells (Fortes, et al., 1995, J. Gen. Virol. 76:1001-100). These findings suggest that the redistribution of NS1-BP tothe nucleoplasm in infected cells is not merely a consequence of speckledisintegration. The cellular NS1-BP may therefore be relocalized via thebinding to the viral NS1 protein. The intracellular relocalization islikely to have an impact on NS1-BP finction in virus-infected cells.

8.2.6. A TRUNCATED NS1-BP PROTEIN INHIBITS PRE-mRNA SPLICING AT A STEPAFTER SPLICEOSOME ASSEMBLY

It has been demonstrated that the NS1 protein can inhibit pre-mRNAsplicing in vivo and in vitro (Fortes, et al., 1994, EMBO J. 13:704-712; Lu, et al., 1994, Genes Dev. 8:1817-1828). The block insplicing was assigned to a step after the assembly of spliceosomes, butbefore the first catalytic event (Lu, et al., 1994; Genes Dev.8:1817-1828). It was hypothesized that the binding of the NS1 protein toa cellular protein(s) whose function is essential for splicing, leads tothe observed block in splicing. Given the intranuclear colocalization ofthe NS1-binding protein with well-known factors of the mRNA splicingapparatus, the role of NS1-BP in pre-mRNA splicing was analyzed throughthe use of in vitro splicing assays using HeLa cell nuclear extracts. Atruncated NS1-BP in form of a GST-NS1-BP fusion protein was used as apotential dominant-negative inhibitor of the endogenous NS1-BP. Such astrategy has previously been employed to examine a functional role of aprotein in RNA splicing (Yuryev, et al., 1996, Proc. Natl. Acad. Sci.USA 93: 6975-6980).

The splicing of a ³²P-labeled pre-mRNA in HeLa cell nuclear extracts wasanalyzed in the presence of GST-NS1-BP, GST-NS1 or control GST protein(FIG. 23). The formation of heparin-resistant splicing complexes in thesame reactions by native gel electrophoresis was also examined. In thecontrol reaction, the intron lariat-exon2 splicing intermediate waseasily detected after a one hour incubation. At the two hour time pointthe accumulation of spliced MRNA and the intron lariat was observed(FIG. 23A, lanes NE). Native gel electrophoresis showed that both A- andB-type spliceosomes were formed normally at 20, 40 and 60 minutes with ahigher proportion of radiolabeled mRNAs shifting into B complex bands atlater time points (FIG. 23B, lanes NE). The A complex contains U2 snRNPand the B complex, which represents the fully assembled spliceosome, theU2, U4/U6 and US snRNPs (Konarska, et al., 1987, Cell 49: 763-774). Theaddition of GST protein to nuclear extract did not change the splicingof the pre-mRNA nor did it interfere with the assembly of spliceosomes(FIG. 23A and B, lanes GST). However, no splicing intermediates orproducts accumulated in the presence of equimolar amounts ofaffinity-punfied GST-NS1 protein (FIG. 23A, lanes GST-NS1). This effectwas not due to a defect in spliceosome assembly, because both A- andB-type complexes assembled, although B bands formed at a slightlyreduced rate (FIG. 23B, lanes GST-NS1). Interestingly, an almostidentical result was obtained in splicing reactions complemented withthe same concentration of purified GST-NS1-BP. There were no splicingproducts detectable after one hour incubation and only trace amounts ofthe exon 1 and intron lariat-exon2 splicing intermediates were detectedat the two hour time point (FIG. 23A and B, lanes GST-NS1-BP). The RNPgel analysis showed that the formation of B-type splicing complexesoccurred in the presence of GST-NS1-BP (FIG. 23B, lanes GST-NS1-BP andGST-NS1). This result demonstrates that the truncated NS1-BP proteinblocks a cellular activity required for the first step of pre-mRNAsplicing as does the viral NS1 protein. The observed splicing inhibitionby the truncated NS1-BP is likely to be the result of adominant-negative effect. on the splicing function of the endogenouswild-type protein.

8.3. DISCUSSION

The NS1 is the only non-structural protein of influenza A viruses thatis expressed in infected cells. Although the NS1 protein has beenimplicated in several different processes including pre-mRNA splicing,mRNA transport and translation, little is known about specific cellularfactors that are recognized by NS1. Since NS1 has pleiotropic effects,it may interact with a variety of proteins in infected cells therebyaffecting different steps of cell metabolism. For example, NS1 appearsto be an auxiliary (virulence) factor that plays a crucial role ininhibiting interferon-mediated antiviral responses of the host.Genetically engineered influenza A virus lacking the NS1 gene showimpaired ability to replicate in normal host cells with functionalinterferon-defense systems, but are able to replicate ininterferon-deficient host cells. The role of NS1 in theinterferon-mediated response makes NS1/host cell protein interactionsexcellent targets for therapeutic intervention.

In order to identify such cellular proteins, the yeast interaction trapwas used, as described supra, to screen a human CDNA expression libraryusing a LexA-NS1 fusion protein as bait. NS1-I (NS1-interactor) which isa cytoplasmic 55 kDa protein that binds to the divergent NS1 proteinsexpressed by influenza A and B viruses has been previously characterized(Wolff, et al., 1996, J. Virol. 70: 5363-5372). NS1-I is derived fromthe precursor protein of a 17β-estradiol dehydrogenase and its bindingto NS1 may have a function in modulating steroid hormone levels invirus-infected cells (Wolff, et al., 1996. J. Virol. 70: 5363-5372).This example demonstrates the identification ofthe NS1-binding protein,NS1-BP, through its specific interaction with the NS1 protein. Thisinteraction was confirmed by the use of an in vitro binding assay. TheNS1 protein co-precipitated with a GST-NS1-BP fusion protein, but notwith GST alone demonstrating that NS1 also physically binds to NS1-BP.

The analysis of the primary structure of the NS1-BP identified tworegions with considerable homology to known proteins. The amino-terminal120 amino acids of NS1-P are homologous to the BTB/POZ domain originallyidentified in a group of proteins that primarily regulate transcription(Bardwell, et al., 1994, Genes Dev. 8: 1664-1677; Zollmann, et al.,1994, Proc. Natl. Acad. Sci. USA 91: 10717-10721). This includes thehuman proto-oncogenes LAZ3/BCL6 and PLZF and the Drosophila Tramtrack,GAGA and Broad Complex proteins (for a review, see Albagli, et al., 1995Cell Growth Diff. 6:1193-1198). It has been shown that the isolatedBTB/POZ domains of bric a brac (bab), ZID, LAZ3/BCL6 and Kelch canmediate homo- and/or heterodimerization suggesting that BTB/POZ domainsare a conserved protein-protein interaction motif (Bardwell, et al.,1994, Genes Dev. 8: 1664-1677; Chen, et al., 1995, Mol. Cell. Biol. 15:3424-3429; Dhordain, et al., 1995, Oncogene 11: 2689-2697; Dong, et al.,1996, Proc. Natl. Acad. Sci. 93: 3624-3629; Robinson, et al., 1997, J.Cell. Biol. 138: 799-810). In the case of the bab protein, the first 51amino acids of the BTB/POZ domain were found to be sufficient fordimerization (Chen, et al., 1995, Mol. Cell. Biol. 15: 3424-3429).Another function of this module may be protein targeting to specificnuclear domains, since the appearance of ZID, LAZ3/BCL6 and hZF5proteins in ‘nuclear dots’ depended on the integrity of their BTB/POZdomains (Bardwell, et al., 1994, Genes Dev. 8: 1664-1677; Chen, et al.,1995, Mol. Cell. Biol. 15: 3424-3429; Dhordain, et al., 1995, Oncogene11: 2689-2697; Dong, et al., 1996, Proc. Natl. Acad. Sci. 93: 3624-3629;Sugiura, et al., 1997, Biochem. Biophys. Acta 1352: 23-26).

The human CDNA isolated through the interaction trap encoded amino acids347 to 619 of NS1-BP, which suggests that this region contains thebinding site for the NS1 protein. This part of the NS1-BP almostentirely consists of five imperfect repeats of 47-49 amino acids thatare homologous to the kelch repeat motif (Bork, et al., 1994, J. Mol.Biol. 236: 1277-1282). Based on phylogenetic sequence comparisons it wassuggested that kelch repeats take on a conserved three-dimensional foldthat was previously identified in procaryotic and eucaryotic enzymes(Bork, et al., 1994, J. Mol. Biol. 236: 1277-1282). A high resolutionx-ray diffraction analysis for one of these enzymes, galactose oxidaseof D. dendroides, revealed that each repeat element folds into ablade-like domain of four-stranded antiparallel β sheets. The blade-likedomains are circularly arranged resulting in a β propeller structure(Ito, et al., 1991, Nature 350: 87-90). The sequence homology suggeststhat the five repeats of NS1-BP may also adopt a similarthree-dimensional fold.

In spite of the conservation on the sequence level, kelch repeats appearto have diverged functions in the homologous proteins. In galactoseoxidase of D. dendroides, the kelch repeat fold contains the catalyticcenter of the enzyme (Ito, et al., 1991, Nature 350: 87-90). On theother hand, the kelch elements of the β-scruin protein of L. polyphemushave been shown to bind to actin which lead. to the proposal, that kelchrepeats may constitute an actin-binding domain (Schmid, et al., 1994, J.Cell. Biol. 124: 341-350; Way, et al., 1995, J. Cell Biol. 128: 51-60).However, other proteins that contain kelch repeats like the α-scruin orthe calicin proteins were localized to intracellular regions that appearto be devoid of actin (von Bülow, et al., 1995, Exp. Ceil. Res. 219:407-413; Way, et al., 1995, J. Cell Sci. 108: 3155-3162).

The proteins encoded by different. poxviruses that are homologous toNS1-BP have not been studied. The genes of the vaccinia virus A55R, C2Land F3L ORF products could be deleted without affecting viralreplication in tissue culture and are therefore considered to benon-essential (Kotwal, et al., 1988, Virology 167: 524-537; Perkus, etal., 1991, Virology 180: 406-410). However, the presence of homologousproteins in different poxviruses argues for important roles of theseproteins. For example, these gene products may increase virus virulenceor otherwise play a role in infected animals.

By immunolocalization studies described in this example demonstrate thatthe NS1-BP is concentrated in discrete regions in the nucleus ofnon-infected cells. This intracellular distribution is compatible with afunction of the NS1-BP in gene regulation. Confocaldouble-immunostaining analyses of cells has previously demonstrated thatthe NS1-BP co-localizes in a speckled pattern with the spliceosomeassembly factor SC35 (Fu, et al., 1990, Nature 343: 437-441). Severalimmunolocalization studies have shown that a number of other factorsinvolved in pre-mRNA splicing, among them the spliceosomal snRNPs, alsoaccumulate in the 20 to 50 irregularly shaped SC35 domains termed‘speckles’ (reviewed by Spector, D. L. 1993. Annu. Rev. Cell Biol. 9:265-315). As shown by electron microscopic analysis, the speckle domainscorrespond to interchromatin granules and perichromatin fibrils (Fakan,1994, Trends Cell Biol. 4: 86-90; Spector, et al., 1991, EMBO J. 10:3467-3481). Different conclusions have been drawn about the functionalsignificance of the accumulation of splicing factors in specificsubnuclear compartments. Since speckle domains localize near genes thatare transcribed and spliced, it was suggested that speckles constitute acompartment in which pre-mRNA is actively spliced (Xing, et al., 1995,J. Cell Biol. 6:1635-1647). However, nascent RNA polymerase IItranscripts were detected by Br-UTP labeling in a random distributionthroughout the nucleoplasm (Fay, et al., 1997, Exp. Cell Res. 231:27-37). Since splicing is thought to occur co-transcriptionally it wasconcluded by this group that pre-mRNA is processed throughout thenucleoplasm. For the speckle domains, a role as a storage or recyclingcompartment that supplies splicing factors to transcription sites wasalso proposed (reviewed by Singer, et al., 1997, Cell 91: 291-294). Inany case, the important role of speckle domains for cellular RNAbiogenesis is emphasized by their dynamic appearance in response toalterations of cellular gene expression. Stress conditions like heatshock that result in inhibition of RNA splicing also induce apparentchanges in the distribution of splicing factors (Bond, 1988, EMBO 7:3509-3518; Spector, etal., 1991, EMBO J. 10: 3467-3481). Thelocalization of the NS1-BP in nuclear regions that contain lughconcentrations of pre-mRNA splicing factors suggests a role for theNS1-BP in mRNA splicing.

The intranuclear localization of the NS1-BP was drastically-altered ininfluenza A virus-infected cells that expressed the NS1 protein. Thespeckled pattern was replaced by a rather homogenous distribution ofNS1-BP throughout the nucleoplasm in a fashion similar to that observedfor the viral NS1 protein. In contrast, only subtle changes weredetected in the appearance of the SC35 protein in influenza Avirus-infected cells. The relocalization of the NS1-BP is thereforeapparently a specific effect and not the result of a disintegration ofthe SC35-enriched domains. This idea is further supported byobservations that stress conditions like heat-shock, serum starvation orthe addition of actinomycin D which inhibits RNA polymerase IItranscription, did not disrupt the colocalization of NS1-BP with SC35.Local changes in NS1-BP concentration in response to an influenza Avirus infection may also influence its function.

The NS1 protein has previously been shown to inhibit pre-mRNA splicingin vitro and in vivo (Fortes, et al., 1994, EMBO J. 13: 704-712; Lu, etal., 1994, Genes Dev. 8:1817-1828). It was speculated that theinhibition of splicing would result in the retention of pre-mRNA in thenucleus of infected cells thereby increasing the concentration of MRNAcap structures available for cap-snatching by the viral RNA polymerase(Lu, et al., 1994, Genes Dev. 8:1817-1828). Alternatively, the activityof the NS1 protein may contribute to the observed regulated splicing ofthe viral mRNAs derived from segments 7 and 8 (Smith, et al., 1985, EMBOJ. 4: 2313-2319; Valcarel, et al., 1991, J. Gen. Virol. 72: 1301-1308).Because the cellular NS1-BP is concentrated in nuclear regions enrichedin pre-mRNA splicing factors and it relocalizes in virus-infected cells,the role of NS1-BP in pre-mRNA splicing in vitro examined. A truncatedNS1-BP was used as a potential non-functional competitor of theendogenous protein in HeLa cell nuclear extract and the effects of thisprobe were compared to the known inhibition of pre-mRNA splicing by theNS1 protein. A similar experimental design has been used before byothers to examine the role of the large subunit of RNA polymerase II inpre-mRNA splicing (Du, et al., 1997, J. Cell Biol. 136: 5-18; Yurev, A.,et al., 1996. Proc. Natl. Acad. Sci. USA 93: 6975-6980). This exampledemonstrates that the truncated NS1-BP blocks the splicing of a³²P-labeled pre-mRNA in HeLa cell nuclear extracts at the same step asdoes the NS1 protein. Splicing complexes formed at only slightly reducedrates in the presence of each of the two proteins. However, theconversion of the pre-mRNA into splicing intermediates or products washighly reduced. This finding suggests that both proteins act on the samestage of the spliceosome pathway, i.e., they block an activity requiredfor the first catalytic step. The shortened NS1-BP that was used lacksthe 346 N-terminal amino acids of the wild-type protein and was fused tothe 26 kDa GST protein. This mutant NS1-BP protein is therefore unlikelyto retain the full activity of the wild-type protein. However, thetruncated NS1-BP may still be able to interact with other essentialsplicing factors thereby preventing their association with the wild-typeNS1-BP. These results are compatible with a role of the wild-type NS1-BPin pre-mRNA splicing.

In normal cells, NS1-BP is concentrated in intranuclear domains that areenriched with multiple splicing factors. This example demonstrates thatthe cellular NS1-BP is specifically relocalized in influenza Avirus-infected cells that express the NS1 protein. Redistribution ofNS1-BP is likely to alter its function or activity. The influenza Avirus may thus inhibit host cell splicing in infected cells by theassociation of the viral NS1 protein with the cellular NS1-BP. The NS1protein may either downregulate NS1-BP activity directly by blocking itsnormal association with spliceosomes. Alternatively, a mechanism can beenvisioned, in which the viral NS1 protein removes the cellular NS1-BPfrom centers of active splicing thereby lowering its availability forparticipation in cellular mRNA splicing processes. In both models, therelocalization of NS1-BP may reflect its disrupted function. It haspreviously been suggested that the NS1 protein inhibits splicing bybinding to U6 snRNA (Lu, et al., 1994, Genes Dev. 8:1817-1828; Qiu, etal., 1995, RNA 1: 304-316) which is a key component of the catalyticcore within the spliceosome (Guthrie, 1991, Science 253: 157-163; Sharp,1994, Cell 77: 805-815). The apparent inhibition of splicing by the NS1protein through the binding to NS1-BP does not exclude an NS1-U6interaction. It is estimated that at least 80-100 different factors areinvolved in pre-mRNA splicing (Green, 1991, Annu. Rev. Cell Biol. 7:559-599; Sharp, 1994, Cell 77: 805-815). For that reason it has beendifficult to dissect their dynamic and complex interactions during theassembly of a spliceosome and the catalysis of the splicing reaction. Itis possible that interactions of the NS1 protein with both NS1-BP and U6snRNA contribute to the inhibition of pre-mRNA splicing.

Like the influenza A virus NS1 protein, the essential ICP27 protein ofherpes simplex virus type 1 (HSV-1) has been implicated in impairingcellular pre-mRNA splicing, possibly as part of a host cell shut-offmechanism (Hardy, et al., 1994, J. Virol. 68: 7790-7799). There appearto be parallels between lytic infections by influenza A virus and HSV-1.Similarly as shown for the NS1 protein, the ICP27 protein has been foundto have pleiotropic regulatory effects. Roles for ICP27 in mRNA 3′-endprocessing (Brown, et al., 1995, J. Virol. 69: 7187-7195; McLauchlan, etal., 1992, J. Virol. 66: 6939-6945) and in mRNA export (Phelan, et al.,1997, J. Gen. Virol. 78: 3327-3331; Soliman, T. M., et al., 1997. J.Virol. 71: 9188-9197) were suggested in addition to an inhibitoiveffecton pre-mRNA splicing. Furthermore, the expression of ICP27 induces theredistriibution of SC35 and spliceosomal snRNPs from the known speckledpattern to few condensed intranuclear structures in which theycolocalize with the ICP27 protein (Phelan, et al., 1993, Proc. Natl.Acad. Sci. USA 90: 9056-9060; Sandri-Goldin, et al., 1995, J. Virol. 69:6063-6076). This pattern is basically the opposite to the situationobserved in influenza A virus-infected cells. In this example, thenumber of SC35 domains appears to increase with a concomitant decreasein size (Fortes, et al., 1995, J. Gen. Virol. 76: 1001-1007) (thisstudy) and in contrast to ICP27, the NS1 protein is localized throughoutthe nucleus in a diffuse pattern.

9. EXAMPLE A PROLINE-RICH MOTIF WITHIN THE MATRIX PROTEIN OFRHABDOVIRUSES INTERACTS WITH CELLULAR WW-DOMAINS AND FUNCTIONS INBUDDING

The matrix (M) protein of rhabdoviruses plays a key role in viralassembly and budding, however the precise mechanism by which M mediatesthese processes remains unclear. A highly conserved, proline-rich motif(PPxY or PY motif; where P-proline, Y-tyrosine and x-any amino acid) ofrhabdoviral M proteins was found to be associated with a functional rolein budding mediated by the M protein. Point mutations that disrupt thePY motif of the M protein of vesicular stomatitis virus (VSV) had noobvious effect on membrane localization of M, but instead lead to adecrease in the amount of M protein released from cells in a functionalbudding assay. Interestingly, the PPxY sequence within rhabdoviral Mproteins was identical to that of the ligand which interacts with WWdomains of cellular proteins. Indeed, far-western blotting demonstratedthat the PY motifs of both VSV (PPPY) and rabies virus (PPEY) M proteinscan interact specifically with WW domains of cellular proteins.Moreover, point mutations that disrupt the consensus PY motif of VSV orrabies virus M protein resulted in a significant decrease in theirability to bind to ellular WW domains. These properties of the PY motifof rhabdovirus M proteins are trikingly analogous to those of the late(L) budding domain identified in the gag-specific rotein p2b of Roussarcoma virus (RSV). The results of the following example indicate hatrhabdoviruses usurp host proteins to facilitate the budding process, andthat M-mediated budding of rhabdoviruses and gag-mediated budding ofretroviruses have features in common.

This example demonstrates that a highly conserved PPxY motif at theamino termini of several rhabdoviral M proteins is important for viralbudding. The PPxY motifs of both VSV and rabies virus M proteins wereshown to interact specifically with WW domains of cellular proteins,including YAP. Mutations that disrupted the viral PPxY motifs of VSV andrabies virus M proteins also disrupted-their ability to interact with WWdomains. Lastly, the PPxY motif of VSV M was shown to be important forthe release of M protein from cells in a functional budding assay. Takentogether, these results demonstrate that the M proteins of rhabdovirusespossess a proline-rich budding domain similar to the L domain found inthe gag proteins of retroviruses, and that the budding domain of M maymediate its function through interactions with selected host proteins.

9.1. MATERIALS AND METHODS

9.1.1. CELLS AND VIRUSES

Stocks of CV-1 and BHK-21 cells were maintained in Dulbecco's MinimalEssential Medium (DMEM; Life Technologies) supplemented with 10 fetalcalf serum (Hyclone). Vesicular stomatitis virus (Indiana serotype) waspropagated in BHK-21 cells.

9.1.2. PLASMIDS

The M gene of VSV (Indiana serotype) was cloned by RT-PCR using primersflanking the open reading frame and containing an EcoRV (5′) and XbaI(3′) restriction endonuclease sites. The PCR product was inserted intothe EcoR V/XbaI-digested pSP72 vector containing the bacteriophage T7promoter (Promega) using standard protocols (Ausubel, et al., 1992,Current Protocols in Molecular Biology, John Wiley and Sons, New York).Briefly, total RNA was isolated from BHK-21 cells infected with VSVusing the TRIzol reagent and protocol of the manufacturer (LifeTechnologies). Reverse-transcription was performed using AMV reversetranscriptase (Life Technologies), and PCR was performed using standardprotocols with Vent DNA polymerase (New England Biolabs). PCR fragmentsencoding amino acids 1-74 of VSV M, 1-202, 1-69, and 1-52 of rabies Mwere inserted into the BamHI and EcoRI restriction sites of the vectorpGEX-2TK (Pharmacia) for expression of gst fusion proteins.Oligonucleotide primers and standard PCR protocols were utilized tointroduce point mutations within the PPxY motifs of VSV and rabies virusgst-M fusion proteins. All plasmids and introduced mutations wereconfirmed by restriction endonuclease digestion and DNA sequencing bythe Sanger method (Sanger, et al., 1977, Proc. Natl. Acad. Sci. U.S.A.,74, 5463-5467.). Plasmid DNAs were maintained in either E. coli strainDH5alpha (Life Technologies) or strain SURE2 (Stratagene), and DNA waspurified using the Qiagen purification system (Qiagen Inc.).

9.1.3. PURIFICATION OF M PROTEIN FROM VSV VIRIONS

Briefly, the supernatant was harvested from VSV infected BHK-21 cells at36 hour post-infectionand clarified first at 2500 rpm for 10 min., andthen at 3200 rpm for 10 min. The supernatant was then centrifuged at36,000 rpm for 30 min. in an SW41 rotor. The virion pellet was thensuspended in 400 μl of buffer containing 10 mM Tris pH=8.0, 0.25 M NaCl,1.0 Triton-X100, and 0.2 mg/ml DTT and incubated at room temperature for30 min. The sample was then centrifuged at 75,000 rpm for 2 hours in aTL-100 ultracentrifuge (Beckman). The supernatant fraction (S) wasremoved and stored at −70° C., while the pellet fraction (P) wassuspended in 400 μl of the above buffer and then stored at −70° C.

9.1.4. PURIFICATION AND RADIOLABELING OF GST FUSION PROTEINS

All gst fusion proteins were expressed from the plasmid pGEX-2TK in E.coli SURE2 cells using the gst Gene Fusion System and the protocols ofthe manufacturer (Pharmacia). The labeling of the fusion proteins with³²P-gamma ATP (6000Ci/mmol; NEN Dupont) and far-western blotting were asdescribed previously (Kaelin, et al., 1992, Cell, 70, 351-364, Chen andSudol, 1995, Proc. Natl. Acad. Sci. USA, 92, 7819-7823).

9.1.5. BUDDING ASSAY

The budding assay was essentially performed as described in Justice, etal., 1995, J. Virol., 69, 3156-3160. Briefly, 35 mm dishes of CV-1 cellswere infected with VvT7 (generously provided by B. Moss, NationalInstitutes of Health), and then transfected with the appropriate plasmidusing the DOTAP reagent (Boehringer Mannheim Corporation). At 2 hourspost-transfection the cells were metabolically labeled with 150 μCi of⁵SMet-Cys (NEN Dupont), and the cells and media were harvested atvarious times post-transfection. Cells were lysed in RIPA buffer (50 mMTris pH=8.0, 150 mM NaCl, 1.0 NP-40, 0.5 deoxycholate, 0.1 SDS), while900 μl of media was added to 100 μl of 10× NTE buffer (0.5 M Tris-HClpH=7.5, 1.5M NaCl, 1.0 NP-40, 10 mM EDTA, 2.5 gelatin, and 0.2M sodiumazide). Immunoprecipitation of both cells and media were performed usingpolyclonal antiserum directed against VSV M. Protein samples werefractionated by SDS-PAGE and visualized by autoradiography.

9.1.6. INDIRECT IMMUNOFLUORESCENCE

Sub-cellular localization of the VSV M protein was accomplished byindirect immunofluorescence. CV-1 cells expressing the M protein of VSVwere fixed and permeabilized for 15 minutes in 2.5 formaldehyde/0.5Triton X-100/PBS. The primary antibody was polyclonal anti-VSV M, whilethe secondary antibody was affinity purified goat-anti-rabbit conjugatedto FITC (Boehringer Mannheim Corporation). Positive cells werevisualized with the use of a Leica CLSM confocal microscope.

9.2. RESULTS

9.2.1. CONSERVATION OF THE PY MOTIF IN THE M PROTEIN OF RHABDOVIRUSES

A globular domain that mediates protein-protein interactions wasidentified recently and shown to be present in a wide range of cellularproteins involved in signal transduction, gene regulation, andcytoskeletal formation (Sudol, 1996, In Blundell, et al., (eds.), Prog.Biophys. Molec. Biol, Vol. 65, Elsevier Science Ltd., Great Britain, pp.113-132). This domain, termed WW domain, is about 38-40 amino acids longand contains a number of conserved amino acids including two highlyconserved tryptophans spaced 20-22 amino acids apart (Sudol, 1996, InBlundell, et al., (eds.), Prog. Biophys. Molec. Biol., Vol. 65, ElsevierScience Ltd., Great Britain, pp. 113-132). The WW domain was shown tointeract with a polyproline ligand having the core consensus sequencePPxY (Chen, et al., 1997, J. Biol. Chem., 272, 17070-17077; Einbond andSudol, 1996, FEBS Lett., 384, 1-8). Of interest to us, is that this PPxYmotif is highly conserved in the M proteins of various rhabdoviruses(Table III, below; Gill and Banerjee, 1986, Virology, 150, 308-312;Kiuchi and Roy, 1984, Virology, 134, 238-243; Rayssiguier, et al., 1986,Virus Res., 5, 177-190; Rose and Gallione, 1981, J. Virol., 39,519-528). Not only is the primary sequence conserved, but also therelative location within the N-termini of these M proteins is maintained(Table III, below). In addition to the rhabdoviruses, the putativematrix proteins (VP4O) of both Ebola and Marburg viruses (filovirusesthat were initially classified as rhabdoviruses) also contain the PPxYmotif at their amino termini (Table III, below; Bukreyev, et al., 1995,Arch. Virol., 140, 1589-1600; Sanchez, et al., 1993, Virus Res., 29,215-240). The highly conserved nature of the sequence and topology ofthe PPxY motif within these viral structural proteins implies animportance perhaps in the structure and/or function of these proteins.

9.2.2. VSV M PROTEIN INTERACTS WITH CELLULAR WW DOMAINS IN VITRO

Since the highly conserved PY motif present within rhabdoviral Mproteins is identical to the sequence of the ligand which interacts withWW domains, it was determined whether the M protein of VSV couldinteract with WW domains of cellular proteins in a far-western blottingassay. To isolate the M protein from VSV virion, BHK-21 cells wereinfected with VSV, and progeny virions were purified from thesupernatant as described (see Section 9.1 Materials and Methods). Thevirion preparation was divided into pellet (P) and soluble (S)fractions, and the viral proteins present within each fraction wereanalyzed by SDS-PAGE and visualized by staining with coomassie brilliantblue (FIG. 24A). As expected, the soluble fraction containedpredominantly the two viral envelope-associated proteins G(glycoprotein) and M, while the pellet fraction contained the viralnucleocapsid (N) protein in addition to G and M (FIG. 24A). In additionto the soluble fraction of purified virions (VR), cell extracts frommock-infected (m) or VSV infected (V) BHK-21 cells were probed with³²P-labeled glutathione-S-transferase alone (gst), or a fusion proteinconsisting of gst and WW domain 1 (gstYAPWW1) from the mouse YAP (FIG.24B). Following a 12 hour exposure of the film, the M protein (atapproximately 30 kD molecular mass) from purified virions interactedstrongly with gstYAPWW1, but not with gst alone (FIG. 24B, lanes VR). Alonger exposure of the filter (FIG. 24C) demonstrated that the VSV Mprotein from infected cell extracts also interacted with the gstYAPWW1probe (FIG. 24C, lane V), but not with gst alone (data not shown). Aprotein of 30 kD was not detected in the mock-infected cell extract(FIG. 24C, lane m). Instead, a protein of approximately 38 kD wasobserved in the mock-infected cell extract probed with gstYAPWW1 (FIG.24C, lane m). The presence of this cellular protein was not unexpected,since it has been described previously as WBP-2; a cellular protein ofunknown function that interacts with the WW domain of YAP (Chen andSudol, 1995, Proc. Natl. Acad. Sci. USA, 92, 7819-7823). Interestingly,WBP-2 was not observed in the VSV-infected cell extract (FIG. 24C, laneV). WBP-2 was also not detected when probed with gst alone (data notshown). These data indicate that full-length M protein from VSV virionsand VSV-infected cell extracts can interact with the WW domain of acellular protein.

9.2.3. THE N-TERMINUS OF VSV M IS SUFFICIENT FOR INTERACTING WITHCELLULAR WW DOMAINS

To determine whether the N-tenninus of VSV M containing the PPxY motifwas sufficient to mediate the interaction with the WW domains of YAP,the full-length M gene of VSV was first cloned by RT-PCR. Severalplasmids were then constructed to express the N-terminal 74 amino acidsof VSV M fused to the gst moiety (FIG. 25). The PPxY motif, which beginsat amino acid position 24, was unmodified in plasmid gstVSVM74WT (FIG.25). However, in plasmids gstVSVM74P-A and gstVSVM74Y-A proline (P) 24was changed to alanine (A) and tyrosine (Y) 27 was changed to (A),respectively (FIG. 25). The gstVSVM74WT protein was expressed in E. coligrown under inducing (IN) conditions (FIG. 26A). Equivalent amounts ofinduced or uninduced bacterial extracts were immobilized ontonitrocellulose filters and probed with gstYAPWW2 (FIG. 26B, lanes 2 and3), gstYAPWW1 (lanes 4 and 5), or gst alone (lanes 6 and 7). ThegstVSVM74WT fusion protein interacted with both WW domains 1 and 2 fromthe mouse YAP, however a reproducibly stronger interaction was observedwith WW domain 1 (FIG. 26B, compare lanes 2 and 4). A similar preferencefor WW domain 1 was also observed when full-length M protein frompurified virions and VSV-infected cells extracts were assayed byfar-western analysis. The gstVSVM74WT fusion protein did not interactwith gst alone (FIG. 26B, lane 6).

To further confirm that the PY motif present within the N-terminal 74amino acids of VSV M was responsible for this interaction, the fusionproteins containing point mutations within the PY motif were used in asimilar far-western blotting assay. Mutations of the first P or Y inPPxY to alanine have been shown to result in a decrease in theefficiency of binding to WW domains (Chen and Sudol, 1995, Proc. Natl.Acad. Sci. USA, 92, 7819-7823; Chen, et al., 1997, J. Biol. Chem., 272,17070-17077). The wild type and mutant gst fusion proteins wereexpressed to equivalent levels in E. coli (FIG. 27A) and probed withgstYAPWW2 (FIG. 27B). As expected, WW domain 2 of YAP interacted withthe gstVSVM74WT protein (FIG. 27B, lane 1), however the ability of WWdomain 2 of YAP to interact with either of the point mutants was reducedby 90 (FIG. 27B, lanes 2 and 3). The gstYAPWW2 probe did not interactwith gst alone, and the gst probe did not interact with the wild type ormutant gstVSVM74 fusion proteins (data not shown). These datademonstrate that the N-terminal 74 amino acids of VSV M are sufficientfor interacting with WW domains of YAP, and that the viral PPxY motifcan serve as the core ligand for this cellular protein domain.

9.2.4. RABIES VIRUS M PROTEIN INTERACTS WITH CELLULAR WW DOMAINS INVITRO

To determine whether a second rhabdoviral M protein (with limitedoverall sequence identity to M protein of VSV) could interact with WWdomains, the full-length rabies virus M protein or various C-terminaltruncations of rabies M were fused to gst (FIG. 28). The fusion proteingstRabM52Y-A is identical to gstRabM52WT except for a single pointmutation in the PY motif of the rabies M protein which changes thetyrosine to an alanine (FIG. 28). All four gstRabM fusion proteins wereexpressed in E. coli grown under inducing conditions and used infar-western blotting assays (FIG. 29). Duplicate filters containinggstRabM202 and gstRabM69 were probed with either gst alone, or gstYAPWW2(FIG. 29A). Both gstRabM202 and gstRabM69 fusion proteins interactedwith the gstYAPWW2 probe (FIG. 29A, lanes 3 and 4), but not with gstalone (FIG. 29A, lanes 1 and 2). Identical amounts of gstRabM52WT andgstRabM52Y-A fusion proteins (as shown in FIG. 29B) were also probedwith either gst alone, or gstYAPWW2 (FIG. 29C). The gstRabM52WT fusionprotein interacted with gstYAPWW2 (FIG. 29C, lane 4), but not with gstalone (lane 2). In contrast, a single point mutation within the PPxYmotif in protein gstRabM52Y-A completely abolished the interaction withthe YAP WW domain (FIG. 29C, lane 3).

To examine further the rabies M protein and WW domain interaction, therabies fusion proteins were now purified, quantitated, and labeled asprobes against a panel of gstWW domain fusion proteins (FIG. 30). Inaddition to gst alone, equivalent amounts (as shown in FIG. 30C) ofgstYAPWW1, gstYAP WW2, gstDystrophinWW, gstNedd4WW2, gstNedd4WW3, andgstFE65WW fusion proteins were probed with either gstRabM52WT (FIG.30A), or gstRabM52Y-A (FIG. 30B). As expected, gstRabM52WT interactedwith WW domain 2 and WW domain 1 (upon longer exposure of the filter) ofYAP (FIG. 30A). Interestingly, gstRabM52WT also interacted strongly withWW domain 2 from the Nedd4 protein (FIG. 30A). The gstRabM52WT proteindid not interact with the remaining gstWW domains fusion proteinsdemonstrating that there is specificity in this protein-proteininteraction. Once again, the interactions between the rabies M proteinand the various WW domains observed (FIG. 30A) were completely abolishedby the introduction of a single point mutation in the PPxY motif in thegstRab52MY-A protein (FIG. 30B).

9.2.5. THE PY MOTIF OF VSV M FACILITATES BUDDING

A functional budding assay for VSV M has been described previouslyJustice, et al., 1995, J. Virol., 69, 3156-3160. This assay was utilizedto determine whether the PPxY motif of VSV M protein is important inM-mediated budding. CV-1 cells were first infected with the recombinantvaccinia Vifus (VvT7) expressing the bacteriophage T7 polymerase, andthen transfected with plasmid pT7VSVMWT (expressing fill-length, wildtype M protein), plasmid pT7VSVMY-A (identical to wild type M except fora single point mutation within the PY motif changing tyrosine toalanine), or no DNA (mock-transfected). Both the cells and media wereharvested and subjected to immunoprecipitation using polyclonal anti-VSVM antiserum (FIG. 31). Identical amounts of both wild type and mutantVSV M proteins were observed in the cell lysates (FIG. 31A, lanes 3 and4), while no M protein was detected in niock-transfected cells (FIG.31A, lane 2). In contrast, the amount of the mutant M protein (FIG. 31B,lane 3) released into the media by budding was reduced reproducibly byabout 70% as compared to the amount of wild type VSV M protein in themedia (FIG. 31B, lane 2). Thus, a single point mutation within the PPxYmotif of VSV M resulted in a significant decrease in budding efficiency.

To determine whether the defect in budding of the mutant M protein wasdue to an inability to localize to the cell membrane, indirectimmunofluorescence and confocal microscopy were performed on cellstransfected with either the wild type or mutant M protein (FIGS.32A-32C). Transfected cells were examined at 5, 8, and 10 hourspost-transfection (data not shown for 5 and 10 hour time points). Forall time points tested, both the wild type and mutant M proteinslocalized to the cytoplasmic membrane equally well (FIGS. 32A-32C).These data indicate that the defect in budding of the mutant M proteinis not simply due to mislocalization within the cell, but rather to adefect in a later stage of the budding process.

9.3. DISCUSSION

Much progress has been made in studying the assembly and buddingpathways of negative-sense RNA viruses in general, however manyquestions remain concerning the role of both viral and host proteins inthese late stages of the viral life-cycle. With respect to rhabdoviralassembly and egress, the results described in Section 9.2, above,demonstrates that: (i) A highly conserved PPxY motif within the Mprotein of VSV and rabies virus can function as a ligand that interactsin vitro with WW domains of cellular proteins, and (ii) This same PPxYmotif of VSV M protein is functionally important for the buddingprocess. Taken together, these findings indicate that rhabdoviralbudding mediated by the M protein is facilitated by viral-hostinteractions.

The polyproline ligand that interacts with the WW domain has beenidentified and well characterized as having the core consensus sequencePPxY. While the M protein of many rhabdoviruses maintain the PPxY motifat their amino-termini (Table III), it should be noted that the Mproteins from several rhabdoviruses of fish possess a PPxH (H-histidine)motif rather than PPxY (Benmansour, et al., 1994, Virology, 198,602-612). Although the aromatic nature of the amino acid positionoccupied by H rather than Y is maintained, this PPxH motif may alsointeract with either WW domains, or perhaps a WW-like domain. WW domainsfrom YAP and the Nedd4 protein interacted strongly and specifically withthe PPxY motifs of both VSV and rabies virus M proteins, whereas WWdomains from other cellular proteins (dystrophin, FE65, and ESS 1) didnot interact with either viral protein (FIG. 30 and data not shown).Single point mutations within the PPxY motifs of VSV M protein andrabies virus M protein were sufficient to either significantly reduce,or abolish interactions with cellular WW domains. Both the WW domains ofYAP and of the Nedd4 protein were able to interact with the viral Mproteins, and both YAP and Nedd4 can be found on the inner side of theplasma membrane, indicating that these cellular proteins representauthentic, in vivo targets for interacting with the rhabdoviral Mprotein.

Both the WW domain and the related SH3 domain have been implicated inmediating virus-host protein-protein interactions. The Nef protein ofHIV-1 for example possesses a polyproline ligand which has been shown tointeract with Src-family SH3 domains of cellular proteins and to beimportant for optimal viral replication (Lee, et al., 1995, EMBO J., 14,5006-5015; Lee, et al., 1996, Cell, 85, 931-942). The LMP2 protein ofEpstein-Barr virus (EBV) contains two polyproline motifs that have beenpostulated to mediate an interaction between LMP2 and the Src-familytyrqsine kinases, FYN and LYN (Longnecker, et al., 1991, J. Virol., 65,3681-3692). Lastly, of particular interest to us, the PPxY motif ishighly conserved in the gag proteins of many animal and humanretroviruses (Wills, et al., 1994, J. Virol., 68, 6605-6618). One of thebetter characterized gag proteins in terms of functional domainsimportant for gag-mediated budding, is that of Rous sarcoma virus(Bennett, et al., 1991, J. Virol., 65, 272-280; Bennett, et al., 1993,J. Virol., 67, 6487-6498; Craven, et al., 1993, J. Virol., 67,6246-6252; Weldon, et al., 1990, J. Virol., 64, 4169-4179; Weldon, etal., 1993, J. Virol., 67, 5550-5561; Wills and Craven, 1991, AIDS, 5,639-654; Wills, et al., 1991, J. Virol., 65, 3804-3812; Wills, et al.,1994, J. Virol., 68, 6605-6618). Elegant studies have demonstrated thatthe PPxY motif present within the p2b protein of RSV gag not onlyinteracts with WW domains in vitro, but also functions as a late buddingdomain (L domain) which is essential for a late stage in retroviralassembly and release (Gamier, et al., 1996, Nature, 381, 744-745; Wills,et al., 1994, J. Virol., 68, 6605-6618). RSV gag proteins havingmutations in the PPxY motif, or those deleted in this motif aredefective in budding. Late budding domains have been identified in gagproteins of human immunodeficiency virus HIVI-1 and equine infectiousanemia virus, and have been implicated in mediating interactions withhost proteins (Gottlinger, et al., 1991, Proc. Nati. Acad. Sci. USA.,88, 3195-3199; Huang, et al., 1995, J. Virol., 69, 6810-6818; Puffer, etal., 1997, J. Virol., 71, 6541-6546). For example, Tyr Xaa Xaa Leu isapparently the “budding motif” in the gag p⁹ protein of the equineinfectious anemia virus. (Puffer, et al., 1997, J. Virol., 71,6541-6546). A similar motif in the influenza virus M1 protein is Tyr XaaXaa Leu (aa 100-103).

Thus, for the PPxY-containing gag proteins, it has been postulated thatcellular proteins containing WW domains may play a role in thematuration and budding of these retroviruses (Gamier, et al., 1996,Nature, 381, 744-745). Our findings with the M protein of rhabdovirusesparallel those of RSV gag in that the PPxY motif of VSV M can alsointeract with WW domains and appears to be important in the buddingprocess (FIGS. 31 and 32A-32C). These data provide further evidence thatthe M protein of rhabdoviruses and the gag protein of retroviruses arein many ways functionally equivalent. Indeed, the N-terminal 74 aminoacids of VSV M protein containing the PPxY motif are capable offunctionally replacing the L-domain of the p2b protein of RSV in afunctional retroviral budding assay. Moreover, point mutations thataltered the PPxY motif of VSV M in these chimeric M-gag proteinsresulted in a protein that was defective in budding. Results from theinitial immunofluorescence analyses (FIGS. 32A-32C) lend support to theidea that the PPxY motif of VSV M protein is not required for membranelocalization, but rather appears to be required for a later step in thebudding pathway. Thus, the PPxY motif of the VSV M protein likelyfunctions as a rhabdoviral late budding domain.

Should the PPxY-WW domain interaction prove to be a crucial interactionbetween the virus and host in vivo, then this virus-host interactioncould serve as a potential target for antiviral agents designed todisrupt or block this step of viral assembly and release. Since the WWdomain and the core motif of its ligand are relatively short, one couldspeculate that such antiviral agents could be easily selected fromchemical libraries of low molecular weight compounds (Sudol, 1997,Emerging therapeutic targets, Vol. 1, pp. 81-84). Also, if indeed thePPxY-WW domain interaction represents a common step in the assemblypathways of rhabdoviruses, retroviruses, and filoviruses, it is temptingto speculate that antivirals which target this interaction may beeffective against a variety of viral pathogens including Ebola andMarburg viruses.

TABLE III

¹The sequence is derived from the matrix protein of VSV, rabies, Piry,and S.V. of C. viruses The sequence is derived from the VP40 protein ofEbola and Marburg viruses. ²The numbers represent amino acid positionsof the PY motif within the protein beginning from the N-terminus. ³Thissequence (14 amino acids) of the rabies virus matrix protein isperfectly conserved in strains SAD B19,CVS, Nishigahara, ERA, and PV.⁴Spring Viremia of Carp virus. ⁵Ebola virus (Zaire strain). ⁶Marburgvirus (Popp strain). ⁷GenBank Accession number.

10. EXAMPLE INHIBITION OF GROWTH OF INFLUENZA A/WSN/VIRUS BY THENP-SPECIFIC NLS OLIGOPEPTIDE

The yeast 2-hybrid assay has been used previously to identify thespecific domains on the NP that bind to the NPI protein (Wang, et al.,1997, J. Virology 71(3): 1850-1856). Mutational analysis includingalanine scanning identified the motifs Ser Xaa Gly Thr Lys Arg Ser TyrXaa Xaa Met (SEQ ID NO:40) and Thr Lys Arg Ser Xaa Xaa Xaa Met (SEQ IDNO:41). which are required for binding of NP to NPI-1 and NPI-3,respectively. These sequences were shown to possess nuclear localizationsignal (NLS) activity. In the present example, MDBK cells were infectedat an m.o.i.=1 in the presence or absence of a 19-amino acid NP-NLSoligopeptide (Ala Ser Gln Gly Thr Lys Arg Ser Tyr Glu Gln Met Glu ThrAsp Gly Glu Arg Gln (SEQ ID NO:22); corresponding to amino acids 2-20 ofNP) and an 11-amino-acid control peptide (Cys Gly Asp Leu Arg Leu ThrLeu Leu Glu Leu (SEQ ID NO:42)). Hemagglutination titer was measured at24 hours p.i. At a concentration of 100 μm, the NP-NLS oligopeptideinhibits viral growth a thousand-fold.

The ability of different fragments of the NPI-1 protein to bind theNP-NLS peptide were analyzed. The region of NPI-1 primarily responsiblefor the binding of NPI-1 to the NP-NLS lies between aa 425-538 at thecarboxy terminus of NPI-1, in the region of the STAT-1 binding domain.

These results demonstrate that peptides that correspond to the NP-NLStarget site of the NPI-1 binding domain of NP can inhibit theinteraction of NP with NPI proteins and can therefore be used to treatinfluenza infection.

The present invention is not to be limited in scope by the specificembodiments described which are intended as single illustrations ofindividual aspects of the invention, and functionally equivalent methodsand components are within the scope of the invention. Indeed, variousmodifications of the invention, in addition to those shown and describedherein will become apparent to those skilled in the art from theforegoing description and accompanying drawings. Such modifications areintended to fall within the scope of the appended claims.

42 1 27 DNA Artificial Sequence pLexA-NP bait plasmid 1 gactggctggaattccccat ggcgtcc 27 2 9 PRT Artificial Sequence pLexA-NP bait plasmid2 Asp Trp Leu Glu Phe Pro Met Ala Ser 1 5 3 2940 DNA Homo sapiens CDS(47)...(1660) 3 ctaacttcag cggtggcacc gggatcggtt gccttgagcc tgaaat atgacc acc 55 Met Thr Thr 1 cca gga aaa gag aac ttt cgc ctg aaa agt tac aagaac aaa tct ctg 103 Pro Gly Lys Glu Asn Phe Arg Leu Lys Ser Tyr Lys AsnLys Ser Leu 5 10 15 aat ccc gat gag atg cgc agg agg agg gag gaa gaa ggactg cag tta 151 Asn Pro Asp Glu Met Arg Arg Arg Arg Glu Glu Glu Gly LeuGln Leu 20 25 30 35 cga aag cag aaa aga gaa gag cag tta ttc aag cgg agaaat gtt gct 199 Arg Lys Gln Lys Arg Glu Glu Gln Leu Phe Lys Arg Arg AsnVal Ala 40 45 50 aca gca gaa gaa gaa aca gaa gaa gaa gtt atg tca gat ggaggc ttt 247 Thr Ala Glu Glu Glu Thr Glu Glu Glu Val Met Ser Asp Gly GlyPhe 55 60 65 cat gag gct cag att agt aac atg gag atg gca cca ggt ggt gtcatc 295 His Glu Ala Gln Ile Ser Asn Met Glu Met Ala Pro Gly Gly Val Ile70 75 80 act tct gac atg att gag atg ata ttt tcc aaa agc cca gag caa cag343 Thr Ser Asp Met Ile Glu Met Ile Phe Ser Lys Ser Pro Glu Gln Gln 8590 95 ctt tca gca aca cag aaa ttc agg aag ctg ctt tca aaa gaa cct aac391 Leu Ser Ala Thr Gln Lys Phe Arg Lys Leu Leu Ser Lys Glu Pro Asn 100105 110 115 cct cct att gat gaa gtt atc agc aca cca gga gta gtg gcc aggttt 439 Pro Pro Ile Asp Glu Val Ile Ser Thr Pro Gly Val Val Ala Arg Phe120 125 130 gtg gag ttc ctc aaa cga aaa gag aat tgt tca ctg cag ttt gaatca 487 Val Glu Phe Leu Lys Arg Lys Glu Asn Cys Ser Leu Gln Phe Glu Ser135 140 145 gct tgg gta ctg aca aat att gct tca gga aat tct ctt cag acccga 535 Ala Trp Val Leu Thr Asn Ile Ala Ser Gly Asn Ser Leu Gln Thr Arg150 155 160 att gtg att cag gca aga gct gtg ccc atc ttc ata gag ttg ctcagc 583 Ile Val Ile Gln Ala Arg Ala Val Pro Ile Phe Ile Glu Leu Leu Ser165 170 175 tca gag ttt gaa gat gtc cag gaa cag gca gtc tgg gct ctt ggcaac 631 Ser Glu Phe Glu Asp Val Gln Glu Gln Ala Val Trp Ala Leu Gly Asn180 185 190 195 att gct gga gat agt acc atg tgc agg gac tat gtc tta gactgc aat 679 Ile Ala Gly Asp Ser Thr Met Cys Arg Asp Tyr Val Leu Asp CysAsn 200 205 210 atc ctt ccc cct ctt ttg cag tta ttt tca aag caa aac cgcctg acc 727 Ile Leu Pro Pro Leu Leu Gln Leu Phe Ser Lys Gln Asn Arg LeuThr 215 220 225 atg acc cgg aat gca gta tgg gct ttg tct aat ctc tgt agaggg aaa 775 Met Thr Arg Asn Ala Val Trp Ala Leu Ser Asn Leu Cys Arg GlyLys 230 235 240 agt cca cct cca gaa ttt gca aag gtt tct cca tgt ctg aatgtg ctt 823 Ser Pro Pro Pro Glu Phe Ala Lys Val Ser Pro Cys Leu Asn ValLeu 245 250 255 tcc tgg ttg ctg ttt gtc agt gac act gat gta ctg gct gatgcc tgc 871 Ser Trp Leu Leu Phe Val Ser Asp Thr Asp Val Leu Ala Asp AlaCys 260 265 270 275 tgg gcc ctc tca tat cta tca gat gga ccc aat gat aaaatt caa gcg 919 Trp Ala Leu Ser Tyr Leu Ser Asp Gly Pro Asn Asp Lys IleGln Ala 280 285 290 gtc atc gat gcg gga gta tgt agg aga ctt gtg gaa ctgctg atg cat 967 Val Ile Asp Ala Gly Val Cys Arg Arg Leu Val Glu Leu LeuMet His 295 300 305 aat gat tat aaa gtg gtt tct cct gct ttg cga gct gtggga aac att 1015 Asn Asp Tyr Lys Val Val Ser Pro Ala Leu Arg Ala Val GlyAsn Ile 310 315 320 gtc aca ggg gat gat att cag aca cag gta att ctg aattgc tca gct 1063 Val Thr Gly Asp Asp Ile Gln Thr Gln Val Ile Leu Asn CysSer Ala 325 330 335 ctg cag agt tta ttg cat ttg ctg agt agc cca aag gaatct atc aaa 1111 Leu Gln Ser Leu Leu His Leu Leu Ser Ser Pro Lys Glu SerIle Lys 340 345 350 355 aag gaa gca tgt tgg acg ata tct aat att aca gctgga aat agg gca 1159 Lys Glu Ala Cys Trp Thr Ile Ser Asn Ile Thr Ala GlyAsn Arg Ala 360 365 370 cag atc cag act gtg ata gat gcc aac att ttc ccagcc ctc att agt 1207 Gln Ile Gln Thr Val Ile Asp Ala Asn Ile Phe Pro AlaLeu Ile Ser 375 380 385 att tta caa act gct gaa ttt cgg aca aga aaa gaagca gct tgg gcc 1255 Ile Leu Gln Thr Ala Glu Phe Arg Thr Arg Lys Glu AlaAla Trp Ala 390 395 400 atc aca aat gca act tct gga gga tca gct gaa cagatc aag tac cta 1303 Ile Thr Asn Ala Thr Ser Gly Gly Ser Ala Glu Gln IleLys Tyr Leu 405 410 415 gta gaa ctg ggt tgt atc aag ccg ctc tgt gat ctcctc acg gtc atg 1351 Val Glu Leu Gly Cys Ile Lys Pro Leu Cys Asp Leu LeuThr Val Met 420 425 430 435 gac tct aag att gta cag gtt gcc cta aat ggcttg gaa aat atc ctg 1399 Asp Ser Lys Ile Val Gln Val Ala Leu Asn Gly LeuGlu Asn Ile Leu 440 445 450 agg ctt gga gaa cag gaa gcc aaa agg aac ggcact ggc att aac cct 1447 Arg Leu Gly Glu Gln Glu Ala Lys Arg Asn Gly ThrGly Ile Asn Pro 455 460 465 tac tgt gct ttg att gaa gaa gct tat ggt ctggat aaa att gag ttc 1495 Tyr Cys Ala Leu Ile Glu Glu Ala Tyr Gly Leu AspLys Ile Glu Phe 470 475 480 tta cag agt cat gaa aac cag gag atc tac caaaag gcc ttt gat ctt 1543 Leu Gln Ser His Glu Asn Gln Glu Ile Tyr Gln LysAla Phe Asp Leu 485 490 495 att gag cat tac ttc ggg acc gaa gat gaa gacagc agc att gca ccc 1591 Ile Glu His Tyr Phe Gly Thr Glu Asp Glu Asp SerSer Ile Ala Pro 500 505 510 515 cag gtt gac ctt aac cag cag cag tac atcttc caa cag tgt gag gct 1639 Gln Val Asp Leu Asn Gln Gln Gln Tyr Ile PheGln Gln Cys Glu Ala 520 525 530 cct atg gaa ggt ttc cag ctt tgaagcaatactctgctttc acgtacctgt 1690 Pro Met Glu Gly Phe Gln Leu 535 gctcagaccaggctacccag tcgagtcctc ttgtggagcc cacagtcctc atggagctaa 1750 cttctcaaatgttttccata atactgtttg cgctcatttg cttgccttgc gcacctgctc 1810 tcttacacacatctggaaaa cctccggctc tctgtggtgg gatacccttc taataaaagg 1870 gtaaccagaacggcccactc tcttttacgg aaaaatccct aggctttgga gatccgcact 1930 tacattagagttatgggaat atacacatat taatgtggct ccctttttct tgtgggggaa 1990 taaaagaggactcctcctca ttccctttaa catgggggaa aaaactgaca ttaaaagatg 2050 agactaaatctttatcttga attttacaca actacttacg acaagggaga tgtttagacc 2110 tgttggtatacttcagagta cttttcatga gttcttccac agtgaaccct tggattacct 2170 ggtggctttttctagccaga ttgcattaat ccttactgag attggatggt tttctttcct 2230 ctattggcgccattcttcag atattaaagt taaaccatcc actccctcac cttcagcctt 2290 cagtgaatgtgctttctagt tgtcaggaat gctgaagaat taacactttg actcctaaat 2350 gtgatactggtgggtaagag cagggcacat ttaatttgtt cgcttttgct tctctttggt 2410 ctgggcacatttaatttgtt cgcttttgct tctctttggt cttttcgaat acttagtaat 2470 cgaaaaccatatcctgtaat ttaataaaaa aaactaagga cgaaaaaacc cctccaattt 2530 tcccaaatgcaatcagtgta actaggggct gtgtttctgc attaaaataa atgtttcagg 2590 ctttgtggtcctgatcaagg tcctcattaa aaaattggag ttcaccctag gcttttcccc 2650 tctgtgactggcagataaca catacttttg aaagtaactt tgggattttt tttcttaggt 2710 gcagctcgattctaatcttt tcatgctgca cacgattcct ttaatcgata gcatccttat 2770 ctgaaagaaataaccatctt ctcaacatga cctgcttaac ccaaataaga acagtgatct 2830 tataacctcattgtttccta atctatttta tttcatctcc tgctagtact gtgccgcttc 2890 cccctccccccacacaaaat aaaaacagta tctcgcttct ggctcatttt 2940 4 538 PRT Homo sapiens4 Met Thr Thr Pro Gly Lys Glu Asn Phe Arg Leu Lys Ser Tyr Lys Asn 1 5 1015 Lys Ser Leu Asn Pro Asp Glu Met Arg Arg Arg Arg Glu Glu Glu Gly 20 2530 Leu Gln Leu Arg Lys Gln Lys Arg Glu Glu Gln Leu Phe Lys Arg Arg 35 4045 Asn Val Ala Thr Ala Glu Glu Glu Thr Glu Glu Glu Val Met Ser Asp 50 5560 Gly Gly Phe His Glu Ala Gln Ile Ser Asn Met Glu Met Ala Pro Gly 65 7075 80 Gly Val Ile Thr Ser Asp Met Ile Glu Met Ile Phe Ser Lys Ser Pro 8590 95 Glu Gln Gln Leu Ser Ala Thr Gln Lys Phe Arg Lys Leu Leu Ser Lys100 105 110 Glu Pro Asn Pro Pro Ile Asp Glu Val Ile Ser Thr Pro Gly ValVal 115 120 125 Ala Arg Phe Val Glu Phe Leu Lys Arg Lys Glu Asn Cys SerLeu Gln 130 135 140 Phe Glu Ser Ala Trp Val Leu Thr Asn Ile Ala Ser GlyAsn Ser Leu 145 150 155 160 Gln Thr Arg Ile Val Ile Gln Ala Arg Ala ValPro Ile Phe Ile Glu 165 170 175 Leu Leu Ser Ser Glu Phe Glu Asp Val GlnGlu Gln Ala Val Trp Ala 180 185 190 Leu Gly Asn Ile Ala Gly Asp Ser ThrMet Cys Arg Asp Tyr Val Leu 195 200 205 Asp Cys Asn Ile Leu Pro Pro LeuLeu Gln Leu Phe Ser Lys Gln Asn 210 215 220 Arg Leu Thr Met Thr Arg AsnAla Val Trp Ala Leu Ser Asn Leu Cys 225 230 235 240 Arg Gly Lys Ser ProPro Pro Glu Phe Ala Lys Val Ser Pro Cys Leu 245 250 255 Asn Val Leu SerTrp Leu Leu Phe Val Ser Asp Thr Asp Val Leu Ala 260 265 270 Asp Ala CysTrp Ala Leu Ser Tyr Leu Ser Asp Gly Pro Asn Asp Lys 275 280 285 Ile GlnAla Val Ile Asp Ala Gly Val Cys Arg Arg Leu Val Glu Leu 290 295 300 LeuMet His Asn Asp Tyr Lys Val Val Ser Pro Ala Leu Arg Ala Val 305 310 315320 Gly Asn Ile Val Thr Gly Asp Asp Ile Gln Thr Gln Val Ile Leu Asn 325330 335 Cys Ser Ala Leu Gln Ser Leu Leu His Leu Leu Ser Ser Pro Lys Glu340 345 350 Ser Ile Lys Lys Glu Ala Cys Trp Thr Ile Ser Asn Ile Thr AlaGly 355 360 365 Asn Arg Ala Gln Ile Gln Thr Val Ile Asp Ala Asn Ile PhePro Ala 370 375 380 Leu Ile Ser Ile Leu Gln Thr Ala Glu Phe Arg Thr ArgLys Glu Ala 385 390 395 400 Ala Trp Ala Ile Thr Asn Ala Thr Ser Gly GlySer Ala Glu Gln Ile 405 410 415 Lys Tyr Leu Val Glu Leu Gly Cys Ile LysPro Leu Cys Asp Leu Leu 420 425 430 Thr Val Met Asp Ser Lys Ile Val GlnVal Ala Leu Asn Gly Leu Glu 435 440 445 Asn Ile Leu Arg Leu Gly Glu GlnGlu Ala Lys Arg Asn Gly Thr Gly 450 455 460 Ile Asn Pro Tyr Cys Ala LeuIle Glu Glu Ala Tyr Gly Leu Asp Lys 465 470 475 480 Ile Glu Phe Leu GlnSer His Glu Asn Gln Glu Ile Tyr Gln Lys Ala 485 490 495 Phe Asp Leu IleGlu His Tyr Phe Gly Thr Glu Asp Glu Asp Ser Ser 500 505 510 Ile Ala ProGln Val Asp Leu Asn Gln Gln Gln Tyr Ile Phe Gln Gln 515 520 525 Cys GluAla Pro Met Glu Gly Phe Gln Leu 530 535 5 542 PRT Saccharomycescerevisiae 5 Met Asp Asn Gly Thr Asp Ser Ser Thr Ser Lys Phe Val Pro GluTyr 1 5 10 15 Arg Arg Thr Asn Phe Lys Asn Lys Gly Arg Phe Ser Ala AspGlu Leu 20 25 30 Arg Arg Arg Arg Asp Thr Gln Gln Val Glu Leu Arg Lys AlaLys Arg 35 40 45 Asp Glu Ala Leu Ala Lys Arg Arg Asn Phe Ile Pro Pro ThrAsp Gly 50 55 60 Ala Asp Ser Asp Glu Glu Asp Glu Ser Ser Val Ser Ala AspGln Gln 65 70 75 80 Phe Tyr Ser Gln Leu Gln Gln Glu Leu Pro Gln Met ThrGln Gln Leu 85 90 95 Asn Ser Asp Asp Met Gln Glu Gln Leu Ser Ala Thr ValLys Phe Arg 100 105 110 Gln Ile Leu Ser Arg Glu His Arg Pro Pro Ile AspVal Val Ile Gln 115 120 125 Ala Gly Val Val Pro Arg Leu Val Glu Phe MetArg Glu Asn Gln Pro 130 135 140 Glu Met Leu Gln Leu Glu Ala Ala Trp AlaLeu Thr Asn Ile Ala Ser 145 150 155 160 Gly Thr Ser Ala Gln Thr Lys ValVal Val Asp Ala Asp Ala Val Pro 165 170 175 Leu Phe Ile Gln Leu Leu TyrThr Gly Ser Val Glu Val Lys Glu Gln 180 185 190 Ala Ile Trp Ala Leu GlyAsn Val Ala Gly Asp Ser Thr Asp Tyr Arg 195 200 205 Asp Tyr Val Leu GlnCys Asn Ala Met Glu Pro Ile Leu Gly Leu Phe 210 215 220 Asn Ser Asn LysPro Ser Leu Ile Arg Thr Ala Thr Trp Thr Leu Ser 225 230 235 240 Asn LeuCys Arg Gly Lys Lys Pro Gln Pro Asp Trp Ser Val Val Ser 245 250 255 GlnAla Leu Pro Thr Leu Ala Lys Leu Ile Tyr Ser Met Asp Thr Glu 260 265 270Thr Leu Val Asp Ala Cys Trp Ala Ile Ser Tyr Leu Ser Asp Gly Pro 275 280285 Gln Glu Ala Ile Gln Ala Val Ile Asp Val Arg Ile Pro Lys Arg Leu 290295 300 Val Glu Leu Leu Ser His Glu Ser Thr Leu Val Gln Thr Pro Ala Leu305 310 315 320 Arg Ala Val Gly Asn Ile Val Thr Gly Asn Asp Leu Gln ThrGln Val 325 330 335 Val Ile Asn Ala Gly Val Leu Pro Ala Leu Arg Leu LeuLeu Ser Ser 340 345 350 Pro Lys Glu Asn Ile Lys Lys Glu Ala Cys Trp ThrIle Ser Asn Ile 355 360 365 Thr Ala Gly Asn Thr Glu Gln Ile Gln Ala ValIle Asp Ala Asn Leu 370 375 380 Ile Pro Pro Leu Val Lys Leu Leu Glu ValAla Glu Tyr Lys Thr Lys 385 390 395 400 Lys Glu Ala Cys Trp Ala Ile SerAsn Ala Ser Ser Gly Gly Leu Gln 405 410 415 Arg Pro Asp Ile Ile Arg TyrLeu Val Ser Gln Gly Cys Ile Lys Pro 420 425 430 Leu Cys Asp Leu Leu GluIle Ala Asp Asn Arg Ile Ile Glu Val Thr 435 440 445 Leu Asp Ala Leu GluAsn Ile Leu Lys Met Gly Glu Ala Asp Lys Glu 450 455 460 Ala Arg Gly LeuAsn Ile Asn Glu Asn Ala Asp Phe Ile Glu Lys Ala 465 470 475 480 Gly GlyMet Glu Lys Ile Phe Asn Cys Gln Gln Asn Glu Asn Asp Lys 485 490 495 IleTyr Glu Lys Ala Tyr Lys Ile Ile Glu Thr Tyr Phe Gly Glu Glu 500 505 510Glu Asp Ala Val Asp Glu Thr Met Ala Pro Gln Asn Ala Gly Asn Thr 515 520525 Phe Gly Phe Gly Ser Asn Val Asn Gln Gln Phe Asn Phe Asn 530 535 5406 170 DNA Saccharomyces cerevisiae 6 ggaggcaccg aagggcagcg ccgagtcggagggggcgaag attgacgcca gtaagaacga 60 ggaggatgaa ggccattcaa actcctccccacgacactct gaagcagcga cggcacagcg 120 ggaagaatgg aaaatgttta taggaggccttagctgggac actacaaaga 170 7 1827 DNA Homo sapiens CDS (1)...(1359) 7 gaggtc aat gtg gag ctg agg aaa gct aag aag gat gac cag atg ctg 48 Glu ValAsn Val Glu Leu Arg Lys Ala Lys Lys Asp Asp Gln Met Leu 1 5 10 15 aagagg aga aat gta agc tca ttt cct gat gat gct act tct ccg ctg 96 Lys ArgArg Asn Val Ser Ser Phe Pro Asp Asp Ala Thr Ser Pro Leu 20 25 30 cag gaaaac cgc aac aac cag ggc act gta aat tgg tct gtt gat gac 144 Gln Glu AsnArg Asn Asn Gln Gly Thr Val Asn Trp Ser Val Asp Asp 35 40 45 att gtc aaaggc ata aat agc agc aat gtg gaa aat cag ctc caa gct 192 Ile Val Lys GlyIle Asn Ser Ser Asn Val Glu Asn Gln Leu Gln Ala 50 55 60 act caa gct gccagg aaa cta ctt tcc aga gaa aaa cag ccc ccc ata 240 Thr Gln Ala Ala ArgLys Leu Leu Ser Arg Glu Lys Gln Pro Pro Ile 65 70 75 80 gac aac ata atccgg gct ggt ttg att ccg aaa ttt gtg tcc ttc ttg 288 Asp Asn Ile Ile ArgAla Gly Leu Ile Pro Lys Phe Val Ser Phe Leu 85 90 95 ggc aga act gat tgtagt ccc att cag ttt gaa tct gct tgg gca ctc 336 Gly Arg Thr Asp Cys SerPro Ile Gln Phe Glu Ser Ala Trp Ala Leu 100 105 110 act aac att gct tctggg aca tca gaa caa acc aag gct gtg gta gat 384 Thr Asn Ile Ala Ser GlyThr Ser Glu Gln Thr Lys Ala Val Val Asp 115 120 125 gga ggt gcc atc ccagca ttc att tct ctg ttg gca tct ccc cat gct 432 Gly Gly Ala Ile Pro AlaPhe Ile Ser Leu Leu Ala Ser Pro His Ala 130 135 140 cac atc agt gaa caagct gtc tgg gct cta gga aac att gca ggt gat 480 His Ile Ser Glu Gln AlaVal Trp Ala Leu Gly Asn Ile Ala Gly Asp 145 150 155 160 ggc tca gtg ttccga gac ttg gtt att aag tac ggt gca gtt gac cca 528 Gly Ser Val Phe ArgAsp Leu Val Ile Lys Tyr Gly Ala Val Asp Pro 165 170 175 ctg ttg gct ctcctt gca gtt cct gat atg tca tct tta gca tgt ggc 576 Leu Leu Ala Leu LeuAla Val Pro Asp Met Ser Ser Leu Ala Cys Gly 180 185 190 tac tta cgt aatctt acc tgg aca ctt tct aat ctt tgc cgc aac aag 624 Tyr Leu Arg Asn LeuThr Trp Thr Leu Ser Asn Leu Cys Arg Asn Lys 195 200 205 aat cct gca cccccg ata gat gct gtt gag cag att ctt cct acc tta 672 Asn Pro Ala Pro ProIle Asp Ala Val Glu Gln Ile Leu Pro Thr Leu 210 215 220 gtt cgg ctc ctgcat cat gat gat cca gaa gtg tta gca gat acc tgc 720 Val Arg Leu Leu HisHis Asp Asp Pro Glu Val Leu Ala Asp Thr Cys 225 230 235 240 tgg gct atttcc tac ctt act gat ggt cca aat gaa cga att ggc atg 768 Trp Ala Ile SerTyr Leu Thr Asp Gly Pro Asn Glu Arg Ile Gly Met 245 250 255 gtg gtg aaaaca gga gtt gtg ccc caa ctt gtg aag ctt cta gga gct 816 Val Val Lys ThrGly Val Val Pro Gln Leu Val Lys Leu Leu Gly Ala 260 265 270 tct gaa ttgcca att gtg act cct gcc cta aga gcc ata ggg aat att 864 Ser Glu Leu ProIle Val Thr Pro Ala Leu Arg Ala Ile Gly Asn Ile 275 280 285 gtc act ggtaca gat gaa cag act cag gtt gtg att gat gca gga gca 912 Val Thr Gly ThrAsp Glu Gln Thr Gln Val Val Ile Asp Ala Gly Ala 290 295 300 ctc gcc gtcttt ccc agc ctg ctc acc aac ccc aaa act aac att cag 960 Leu Ala Val PhePro Ser Leu Leu Thr Asn Pro Lys Thr Asn Ile Gln 305 310 315 320 aag gaagct acg tgg aca atg tca aac atc aca gcc ggc cgc cag gac 1008 Lys Glu AlaThr Trp Thr Met Ser Asn Ile Thr Ala Gly Arg Gln Asp 325 330 335 cag atacag caa gtt gtg aat cat gga tta gtc cca ttc ctt gtc agt 1056 Gln Ile GlnGln Val Val Asn His Gly Leu Val Pro Phe Leu Val Ser 340 345 350 gtt ctctct aag gca gat ttt aag aca caa aag gaa gct gtg tgg gcc 1104 Val Leu SerLys Ala Asp Phe Lys Thr Gln Lys Glu Ala Val Trp Ala 355 360 365 gtg accaac tat acc agt ggt gga aca gtt gaa cag att gtg tac ctt 1152 Val Thr AsnTyr Thr Ser Gly Gly Thr Val Glu Gln Ile Val Tyr Leu 370 375 380 gtt cactgt ggc ata ata gaa ccg ttg atg aac ctc tta act gca aaa 1200 Val His CysGly Ile Ile Glu Pro Leu Met Asn Leu Leu Thr Ala Lys 385 390 395 400 gatacc aag att att ctg gtt atc ctg gat gcc att tca aat atc ttt 1248 Asp ThrLys Ile Ile Leu Val Ile Leu Asp Ala Ile Ser Asn Ile Phe 405 410 415 caggct gct gag aaa cta ggt gaa act agc tgc ccg tct tca cag att 1296 Gln AlaAla Glu Lys Leu Gly Glu Thr Ser Cys Pro Ser Ser Gln Ile 420 425 430 caagaa caa ggg aaa aga cag tac aga aat gag gcg tcc gag gcg tcg 1344 Gln GluGln Gly Lys Arg Gln Tyr Arg Asn Glu Ala Ser Glu Ala Ser 435 440 445 cagaat aga gaa act tagtataatg attgaagaat gtggaggctt agacaaaatt 1399 Gln AsnArg Glu Thr 450 gaagctctac aaaaccatga aaatgagtct gtgtataagg cttcgttaagcttaattgag 1459 aagtatttct ctgtagagga agaggaagat caaaacgttg taccagaaactacctctgaa 1519 ggctacactt tccaagttca ggatggggct cctgggacct ttaacttttagatcatgtag 1579 ctgagacata aatttgttgt gtactacgtt tggtattttg tcttattgtttctctactaa 1639 gaactctttc ttaaatgtgg tttgttactg tagcactttt tacactgaaactatacttga 1699 acagttccaa ctgtacatac atactgtatg aagcttgtcc tctgactaggtttctaattt 1759 ctatgtggaa tttcctatct tgcagcatcc tgtaaataaa cattcaagtccacccttttc 1819 ttgacttc 1827 8 453 PRT Homo sapiens 8 Glu Val Asn ValGlu Leu Arg Lys Ala Lys Lys Asp Asp Gln Met Leu 1 5 10 15 Lys Arg ArgAsn Val Ser Ser Phe Pro Asp Asp Ala Thr Ser Pro Leu 20 25 30 Gln Glu AsnArg Asn Asn Gln Gly Thr Val Asn Trp Ser Val Asp Asp 35 40 45 Ile Val LysGly Ile Asn Ser Ser Asn Val Glu Asn Gln Leu Gln Ala 50 55 60 Thr Gln AlaAla Arg Lys Leu Leu Ser Arg Glu Lys Gln Pro Pro Ile 65 70 75 80 Asp AsnIle Ile Arg Ala Gly Leu Ile Pro Lys Phe Val Ser Phe Leu 85 90 95 Gly ArgThr Asp Cys Ser Pro Ile Gln Phe Glu Ser Ala Trp Ala Leu 100 105 110 ThrAsn Ile Ala Ser Gly Thr Ser Glu Gln Thr Lys Ala Val Val Asp 115 120 125Gly Gly Ala Ile Pro Ala Phe Ile Ser Leu Leu Ala Ser Pro His Ala 130 135140 His Ile Ser Glu Gln Ala Val Trp Ala Leu Gly Asn Ile Ala Gly Asp 145150 155 160 Gly Ser Val Phe Arg Asp Leu Val Ile Lys Tyr Gly Ala Val AspPro 165 170 175 Leu Leu Ala Leu Leu Ala Val Pro Asp Met Ser Ser Leu AlaCys Gly 180 185 190 Tyr Leu Arg Asn Leu Thr Trp Thr Leu Ser Asn Leu CysArg Asn Lys 195 200 205 Asn Pro Ala Pro Pro Ile Asp Ala Val Glu Gln IleLeu Pro Thr Leu 210 215 220 Val Arg Leu Leu His His Asp Asp Pro Glu ValLeu Ala Asp Thr Cys 225 230 235 240 Trp Ala Ile Ser Tyr Leu Thr Asp GlyPro Asn Glu Arg Ile Gly Met 245 250 255 Val Val Lys Thr Gly Val Val ProGln Leu Val Lys Leu Leu Gly Ala 260 265 270 Ser Glu Leu Pro Ile Val ThrPro Ala Leu Arg Ala Ile Gly Asn Ile 275 280 285 Val Thr Gly Thr Asp GluGln Thr Gln Val Val Ile Asp Ala Gly Ala 290 295 300 Leu Ala Val Phe ProSer Leu Leu Thr Asn Pro Lys Thr Asn Ile Gln 305 310 315 320 Lys Glu AlaThr Trp Thr Met Ser Asn Ile Thr Ala Gly Arg Gln Asp 325 330 335 Gln IleGln Gln Val Val Asn His Gly Leu Val Pro Phe Leu Val Ser 340 345 350 ValLeu Ser Lys Ala Asp Phe Lys Thr Gln Lys Glu Ala Val Trp Ala 355 360 365Val Thr Asn Tyr Thr Ser Gly Gly Thr Val Glu Gln Ile Val Tyr Leu 370 375380 Val His Cys Gly Ile Ile Glu Pro Leu Met Asn Leu Leu Thr Ala Lys 385390 395 400 Asp Thr Lys Ile Ile Leu Val Ile Leu Asp Ala Ile Ser Asn IlePhe 405 410 415 Gln Ala Ala Glu Lys Leu Gly Glu Thr Ser Cys Pro Ser SerGln Ile 420 425 430 Gln Glu Gln Gly Lys Arg Gln Tyr Arg Asn Glu Ala SerGlu Ala Ser 435 440 445 Gln Asn Arg Glu Thr 450 9 259 DNA Homo sapiens 9gaacgaccaa gagggtgttc gactgctaga gccgagcaga agcgtgccta aatcaaagga 60acttgtttct tcaagctctt ctggcagtga ttctgacagt gaggttgaca aaaagttaag 120caggaaaaag caagttgctc cagaaaaacc tgtaaagaaa caaaagacag gtgagacttc 180gagagccctg tcatcttcta aacagagcag cagcagcaga gatgataaca tgtttcagat 240tgggaaaatg aggtcagtt 259 10 221 DNA Homo sapiens 10 tgtcgactgtggctttgagc atccgtcaga agtccagcat gagtgcatcc ctcaggccat 60 tctgggaatggatgtcctgt gccaggccaa gtcgggcatg ggaaagacag cagtgtttgt 120 cttggccacactgcaacagc tggagccagt tactgggcag gtgtctgtac tggtgatgtg 180 tcacactcgggagttggctt ttcagatcag caaggaatat g 221 11 372 DNA Homo sapiens 11atttgtaaac cccggagcga ggttctgctt acccgaggcc gctgctgtgc ggagaccccc 60gggtgaagcc accgtcatca tgtctgacca ggaggcaaaa ccttcaactg aggacttggg 120ggataagaag gaaggtgaat atattaaact caaagtcatt ggacaggata gcagtgagat 180tcacttcaaa gtgaaaatga caacacatct caagaaactc aaagaatcat actgtcaaag 240acagggtgtt ccaatgaatt cactcaggtt tctctttgag ggtcagagaa ttgctgataa 300tcatactcca aaagaactgg gaatggagga agaagttgtg attgaagttt atcaggaaca 360aacggggggt ca 372 12 2675 DNA Homo sapiens CDS (104)...(2310) 12tctgaccctc gtcccgcccc cgccattcgc cgcctcctcc tgtcccgcag tcggcgtcca 60gcggctctgc ttgttcgtgt gtgtgtcgtt gcaggcctta ttc atg ggc tca ccg 115 MetGly Ser Pro 1 ctg agg ttc gac ggg cgg gtg gta ctg gtc acc ggc gcg ggggca gga 163 Leu Arg Phe Asp Gly Arg Val Val Leu Val Thr Gly Ala Gly AlaGly 5 10 15 20 ttg ggc cga gcc tat gcc ctg gct ttt gca gaa aga gga gcgtta gtt 211 Leu Gly Arg Ala Tyr Ala Leu Ala Phe Ala Glu Arg Gly Ala LeuVal 25 30 35 gtt gtg aat gat ttg gga ggg cac ttc aaa gga gtt ggt aaa ggctcc 259 Val Val Asn Asp Leu Gly Gly His Phe Lys Gly Val Gly Lys Gly Ser40 45 50 tta gct gat aag gtt gtt gaa gaa ata aga agg aga ggt gga aaa gca307 Leu Ala Asp Lys Val Val Glu Glu Ile Arg Arg Arg Gly Gly Lys Ala 5560 65 gtg gcc aac tat gat tca gtg gaa gaa gga gag aag gtt gtg aag aca355 Val Ala Asn Tyr Asp Ser Val Glu Glu Gly Glu Lys Val Val Lys Thr 7075 80 gcc ctg gat gct ttt gga aga ata gat gtt gtg gtc aac aat gct gga403 Ala Leu Asp Ala Phe Gly Arg Ile Asp Val Val Val Asn Asn Ala Gly 8590 95 100 att ctg agg gat cat tcc ttt gct agg ata agt gat gaa gac tgggat 451 Ile Leu Arg Asp His Ser Phe Ala Arg Ile Ser Asp Glu Asp Trp Asp105 110 115 ata atc cac aga gtt cat ttg cgg ggt tca ttc caa gtg aca cgggca 499 Ile Ile His Arg Val His Leu Arg Gly Ser Phe Gln Val Thr Arg Ala120 125 130 gca tgg gaa cac atg aag aaa cag aag tat gga agg att att atgact 547 Ala Trp Glu His Met Lys Lys Gln Lys Tyr Gly Arg Ile Ile Met Thr135 140 145 tca tca gct tca gga ata tat ggc aac ttt ggc cag gcc aat tatagt 595 Ser Ser Ala Ser Gly Ile Tyr Gly Asn Phe Gly Gln Ala Asn Tyr Ser150 155 160 gct gca aag ttg ggt ctt ctg ggc ctt gca aat tct ctt gca attgaa 643 Ala Ala Lys Leu Gly Leu Leu Gly Leu Ala Asn Ser Leu Ala Ile Glu165 170 175 180 ggc agg aaa agc aac att cat tgt aac acc att gct cct aatgcg gga 691 Gly Arg Lys Ser Asn Ile His Cys Asn Thr Ile Ala Pro Asn AlaGly 185 190 195 tca cgg atg act cag aca gtt atg cct gaa gat ctt gtg gaagcc ttg 739 Ser Arg Met Thr Gln Thr Val Met Pro Glu Asp Leu Val Glu AlaLeu 200 205 210 aag cca gag tat gtg gca cct ctt gtc ctt tgg ctt tgt cacgag agt 787 Lys Pro Glu Tyr Val Ala Pro Leu Val Leu Trp Leu Cys His GluSer 215 220 225 tgt gag gag aat ggt ggc ttg ttt gag gtt ggt gca gga tggatt gga 835 Cys Glu Glu Asn Gly Gly Leu Phe Glu Val Gly Ala Gly Trp IleGly 230 235 240 aaa tta cgc tgg gag cgg act ctt gga gct att gta aga caaaag aat 883 Lys Leu Arg Trp Glu Arg Thr Leu Gly Ala Ile Val Arg Gln LysAsn 245 250 255 260 cac cca atg act cct gag gca gtc aag gct aac tgg aagaag atc tgt 931 His Pro Met Thr Pro Glu Ala Val Lys Ala Asn Trp Lys LysIle Cys 265 270 275 gac ttt gag aat gcc agc aag cct cag agt atc caa gaatca act ggc 979 Asp Phe Glu Asn Ala Ser Lys Pro Gln Ser Ile Gln Glu SerThr Gly 280 285 290 agt ata att gaa gtt ctg agt aaa ata gat tca gaa ggagga gtt tca 1027 Ser Ile Ile Glu Val Leu Ser Lys Ile Asp Ser Glu Gly GlyVal Ser 295 300 305 gca aat cat act agt cgt gca acg tct aca gca aca tcagga ttt gct 1075 Ala Asn His Thr Ser Arg Ala Thr Ser Thr Ala Thr Ser GlyPhe Ala 310 315 320 gga gct att ggc cag aaa ctc cct cca ttt tct tat gcttat acg gaa 1123 Gly Ala Ile Gly Gln Lys Leu Pro Pro Phe Ser Tyr Ala TyrThr Glu 325 330 335 340 ctg gaa gct att atg tat gcc ctt gga gtg gga gcgtca atc aag gat 1171 Leu Glu Ala Ile Met Tyr Ala Leu Gly Val Gly Ala SerIle Lys Asp 345 350 355 cca aaa gat ttg aaa ttt att tat gaa gga agt tctgat ttc tcc tgt 1219 Pro Lys Asp Leu Lys Phe Ile Tyr Glu Gly Ser Ser AspPhe Ser Cys 360 365 370 ttg ccc acc ttc gga gtt atc ata ggt cag aaa tctatg atg ggt gga 1267 Leu Pro Thr Phe Gly Val Ile Ile Gly Gln Lys Ser MetMet Gly Gly 375 380 385 gga tta gca gaa att cct gga ctt tca atc aac tttgca aag gtt ctt 1315 Gly Leu Ala Glu Ile Pro Gly Leu Ser Ile Asn Phe AlaLys Val Leu 390 395 400 cat gga gag cag tac tta gag tta tat aaa cca cttccc aga gca gga 1363 His Gly Glu Gln Tyr Leu Glu Leu Tyr Lys Pro Leu ProArg Ala Gly 405 410 415 420 aaa tta aaa tgt gaa gca gtt gtt gct gat gtccta gat aaa gga tcc 1411 Lys Leu Lys Cys Glu Ala Val Val Ala Asp Val LeuAsp Lys Gly Ser 425 430 435 ggt gta gtg att att atg gat gtc tat tct tattct gag aag gaa ctt 1459 Gly Val Val Ile Ile Met Asp Val Tyr Ser Tyr SerGlu Lys Glu Leu 440 445 450 ata tgc cac aat cag ttc tct ctc ttt ctt gttggc tct gga ggc ttt 1507 Ile Cys His Asn Gln Phe Ser Leu Phe Leu Val GlySer Gly Gly Phe 455 460 465 ggt gga aaa cgg aca tca gac aaa gtc aag gtagct gta gcc ata cct 1555 Gly Gly Lys Arg Thr Ser Asp Lys Val Lys Val AlaVal Ala Ile Pro 470 475 480 aat aga cct cct gat gct gta ctt aca gat accacc tct ctt aat cag 1603 Asn Arg Pro Pro Asp Ala Val Leu Thr Asp Thr ThrSer Leu Asn Gln 485 490 495 500 gct gct ttg tac cgc ctc agt gga gac cggaat ccc tta cac att gat 1651 Ala Ala Leu Tyr Arg Leu Ser Gly Asp Arg AsnPro Leu His Ile Asp 505 510 515 cct aac ttt gct agt cta gca ggt ttt gacaag ccc ata tta cat gga 1699 Pro Asn Phe Ala Ser Leu Ala Gly Phe Asp LysPro Ile Leu His Gly 520 525 530 tta tgt aca ttt gga ttt tct gcc agg cgtgtg tta cag cag ttt gca 1747 Leu Cys Thr Phe Gly Phe Ser Ala Arg Arg ValLeu Gln Gln Phe Ala 535 540 545 gat aat gat gtg tca aga ttc aag gca gttaag gct cgt ttt gca aaa 1795 Asp Asn Asp Val Ser Arg Phe Lys Ala Val LysAla Arg Phe Ala Lys 550 555 560 cca gta tat cca gga caa act cta caa actgag atg tgg aag gaa gga 1843 Pro Val Tyr Pro Gly Gln Thr Leu Gln Thr GluMet Trp Lys Glu Gly 565 570 575 580 aac aga att cat ttt caa acc aag gtccaa gaa act gga gac att gtc 1891 Asn Arg Ile His Phe Gln Thr Lys Val GlnGlu Thr Gly Asp Ile Val 585 590 595 att tca aat gca tat gtg gat ctt gcacca aca tct ggt act tca gct 1939 Ile Ser Asn Ala Tyr Val Asp Leu Ala ProThr Ser Gly Thr Ser Ala 600 605 610 aag aca ccc tct gag ggc ggg aag cttcag agt acc ttt gta ttt gag 1987 Lys Thr Pro Ser Glu Gly Gly Lys Leu GlnSer Thr Phe Val Phe Glu 615 620 625 gaa ata gga cgc cgc cta aag gat attggg cct gag gtg gtg aag aaa 2035 Glu Ile Gly Arg Arg Leu Lys Asp Ile GlyPro Glu Val Val Lys Lys 630 635 640 gta aat gct gta ttt gag tgg cat ataacc aaa ggc gga aat att ggg 2083 Val Asn Ala Val Phe Glu Trp His Ile ThrLys Gly Gly Asn Ile Gly 645 650 655 660 gct aag tgg act att gac ctg aaaagt ggt tct gga aaa gtg tac caa 2131 Ala Lys Trp Thr Ile Asp Leu Lys SerGly Ser Gly Lys Val Tyr Gln 665 670 675 ggc cct gca aaa ggt gct gct gataca aca atc ata ctt tca gat gaa 2179 Gly Pro Ala Lys Gly Ala Ala Asp ThrThr Ile Ile Leu Ser Asp Glu 680 685 690 gat ttc atg gag gtg gtc ctg ggcaag ctt gac cct cag aag gca ttc 2227 Asp Phe Met Glu Val Val Leu Gly LysLeu Asp Pro Gln Lys Ala Phe 695 700 705 ttt agt ggc agg ctg aag gcc agaggg aac atc atg ctg agc cag aaa 2275 Phe Ser Gly Arg Leu Lys Ala Arg GlyAsn Ile Met Leu Ser Gln Lys 710 715 720 ctt cag atg att ctt aaa gac tacgcc aag ctc tg aagggcacac 2320 Leu Gln Met Ile Leu Lys Asp Tyr Ala LysLeu 725 730 735 tacactatta ataaaaatgg aatcattaaa tactctcttc acccaaatatgcttgattat 2380 tctgcaaaag tgattagaac taagatgcag gggaaattgc ttaacattttcagatatcag 2440 ataactgcag attttcattt tctactaatt tttcatgtat cattatttttacaaggaact 2500 atatataagc tagcacataa ttatccttct gttcttagat ctgtatcttcataataaaaa 2560 aattttgccc aagtcctgtt tccttagaat ttgtgatagc attgataagttgaaaggaaa 2620 attaaatcaa taaaggcctt tgataccttt aaaaaaaaaa aaaaaaaaaaaaaaa 2675 13 735 PRT Homo sapiens 13 Met Gly Ser Pro Leu Arg Phe AspGly Arg Val Val Leu Val Thr Gly 1 5 10 15 Ala Gly Ala Gly Leu Gly ArgAla Tyr Ala Leu Ala Phe Ala Glu Arg 20 25 30 Gly Ala Leu Val Val Val AsnAsp Leu Gly Gly His Phe Lys Gly Val 35 40 45 Gly Lys Gly Ser Leu Ala AspLys Val Val Glu Glu Ile Arg Arg Arg 50 55 60 Gly Gly Lys Ala Val Ala AsnTyr Asp Ser Val Glu Glu Gly Glu Lys 65 70 75 80 Val Val Lys Thr Ala LeuAsp Ala Phe Gly Arg Ile Asp Val Val Val 85 90 95 Asn Asn Ala Gly Ile LeuArg Asp His Ser Phe Ala Arg Ile Ser Asp 100 105 110 Glu Asp Trp Asp IleIle His Arg Val His Leu Arg Gly Ser Phe Gln 115 120 125 Val Thr Arg AlaAla Trp Glu His Met Lys Lys Gln Lys Tyr Gly Arg 130 135 140 Ile Ile MetThr Ser Ser Ala Ser Gly Ile Tyr Gly Asn Phe Gly Gln 145 150 155 160 AlaAsn Tyr Ser Ala Ala Lys Leu Gly Leu Leu Gly Leu Ala Asn Ser 165 170 175Leu Ala Ile Glu Gly Arg Lys Ser Asn Ile His Cys Asn Thr Ile Ala 180 185190 Pro Asn Ala Gly Ser Arg Met Thr Gln Thr Val Met Pro Glu Asp Leu 195200 205 Val Glu Ala Leu Lys Pro Glu Tyr Val Ala Pro Leu Val Leu Trp Leu210 215 220 Cys His Glu Ser Cys Glu Glu Asn Gly Gly Leu Phe Glu Val GlyAla 225 230 235 240 Gly Trp Ile Gly Lys Leu Arg Trp Glu Arg Thr Leu GlyAla Ile Val 245 250 255 Arg Gln Lys Asn His Pro Met Thr Pro Glu Ala ValLys Ala Asn Trp 260 265 270 Lys Lys Ile Cys Asp Phe Glu Asn Ala Ser LysPro Gln Ser Ile Gln 275 280 285 Glu Ser Thr Gly Ser Ile Ile Glu Val LeuSer Lys Ile Asp Ser Glu 290 295 300 Gly Gly Val Ser Ala Asn His Thr SerArg Ala Thr Ser Thr Ala Thr 305 310 315 320 Ser Gly Phe Ala Gly Ala IleGly Gln Lys Leu Pro Pro Phe Ser Tyr 325 330 335 Ala Tyr Thr Glu Leu GluAla Ile Met Tyr Ala Leu Gly Val Gly Ala 340 345 350 Ser Ile Lys Asp ProLys Asp Leu Lys Phe Ile Tyr Glu Gly Ser Ser 355 360 365 Asp Phe Ser CysLeu Pro Thr Phe Gly Val Ile Ile Gly Gln Lys Ser 370 375 380 Met Met GlyGly Gly Leu Ala Glu Ile Pro Gly Leu Ser Ile Asn Phe 385 390 395 400 AlaLys Val Leu His Gly Glu Gln Tyr Leu Glu Leu Tyr Lys Pro Leu 405 410 415Pro Arg Ala Gly Lys Leu Lys Cys Glu Ala Val Val Ala Asp Val Leu 420 425430 Asp Lys Gly Ser Gly Val Val Ile Ile Met Asp Val Tyr Ser Tyr Ser 435440 445 Glu Lys Glu Leu Ile Cys His Asn Gln Phe Ser Leu Phe Leu Val Gly450 455 460 Ser Gly Gly Phe Gly Gly Lys Arg Thr Ser Asp Lys Val Lys ValAla 465 470 475 480 Val Ala Ile Pro Asn Arg Pro Pro Asp Ala Val Leu ThrAsp Thr Thr 485 490 495 Ser Leu Asn Gln Ala Ala Leu Tyr Arg Leu Ser GlyAsp Arg Asn Pro 500 505 510 Leu His Ile Asp Pro Asn Phe Ala Ser Leu AlaGly Phe Asp Lys Pro 515 520 525 Ile Leu His Gly Leu Cys Thr Phe Gly PheSer Ala Arg Arg Val Leu 530 535 540 Gln Gln Phe Ala Asp Asn Asp Val SerArg Phe Lys Ala Val Lys Ala 545 550 555 560 Arg Phe Ala Lys Pro Val TyrPro Gly Gln Thr Leu Gln Thr Glu Met 565 570 575 Trp Lys Glu Gly Asn ArgIle His Phe Gln Thr Lys Val Gln Glu Thr 580 585 590 Gly Asp Ile Val IleSer Asn Ala Tyr Val Asp Leu Ala Pro Thr Ser 595 600 605 Gly Thr Ser AlaLys Thr Pro Ser Glu Gly Gly Lys Leu Gln Ser Thr 610 615 620 Phe Val PheGlu Glu Ile Gly Arg Arg Leu Lys Asp Ile Gly Pro Glu 625 630 635 640 ValVal Lys Lys Val Asn Ala Val Phe Glu Trp His Ile Thr Lys Gly 645 650 655Gly Asn Ile Gly Ala Lys Trp Thr Ile Asp Leu Lys Ser Gly Ser Gly 660 665670 Lys Val Tyr Gln Gly Pro Ala Lys Gly Ala Ala Asp Thr Thr Ile Ile 675680 685 Leu Ser Asp Glu Asp Phe Met Glu Val Val Leu Gly Lys Leu Asp Pro690 695 700 Gln Lys Ala Phe Phe Ser Gly Arg Leu Lys Ala Arg Gly Asn IleMet 705 710 715 720 Leu Ser Gln Lys Leu Gln Met Ile Leu Lys Asp Tyr AlaLys Leu 725 730 735 14 2752 DNA Homo sapiens CDS (538)...(2394) 14agtgtctccc ggtcgcgcgt ggaggtcggt cgctcagagc tgctgggcgc agtttctccg 60cctgctgctt cggcgcggct gtatcggcga gcgagcgagt tcccgcgagt tctcggtggc 120gctccccctt cctttcagtc tccacggact ggcccctcgt ccttctactt gaccgctccc 180gtcttctgcc gccttctggc gctttccgtt gggccgattc ccgcccgctt cctcctgctt 240cccatcgaag ctctagaaat gaatgtttcc atctcttcag agatgaacca gattatgatg 300catcattatc acagaagaaa ttcgtgtcta tagcttttaa ggacttgatt acatcatttt 360caagcctgat agttttggaa tcaccattag agcttaagac acacctgcct tcatttcaac 420cacctgtctt cataccctga cgaagtgcac cttttaacac tcctttgtcc ttggattact 480taagagttcc cagaaataca tttgccacca acagagtagc caaatttata aggaaaa atg 540Met 1 att ccc aat gga tat ttg atg ttt gag gat gaa aat ttt att gag tct588 Ile Pro Asn Gly Tyr Leu Met Phe Glu Asp Glu Asn Phe Ile Glu Ser 5 1015 tct gtt gcc aaa tta aat gcc ctg agg aaa agt ggc cag ttc tgt gat 636Ser Val Ala Lys Leu Asn Ala Leu Arg Lys Ser Gly Gln Phe Cys Asp 20 25 30gtt cga ctt cag gtc tgt ggc cat gaa atg tta gca cac aga gca gtg 684 ValArg Leu Gln Val Cys Gly His Glu Met Leu Ala His Arg Ala Val 35 40 45 ctagct tgc tgc agt ccc tat tta ttt gaa atc ttt aat agt gat agt 732 Leu AlaCys Cys Ser Pro Tyr Leu Phe Glu Ile Phe Asn Ser Asp Ser 50 55 60 65 gatcct cat gga att tct cac gtt aaa ttt gat gat ctc aat cca gaa 780 Asp ProHis Gly Ile Ser His Val Lys Phe Asp Asp Leu Asn Pro Glu 70 75 80 gct gttgaa gtc ttg ttg aat tat gcc tac act gct cag ttg aaa gca 828 Ala Val GluVal Leu Leu Asn Tyr Ala Tyr Thr Ala Gln Leu Lys Ala 85 90 95 gat aag gaattg gta aaa gat gtt tat tct gca gca aaa gag ctg aag 876 Asp Lys Glu LeuVal Lys Asp Val Tyr Ser Ala Ala Lys Glu Leu Lys 100 105 110 atg gat cgagta aag cag gtt tgt ggt gat tat tta ctg tct aga atg 924 Met Asp Arg ValLys Gln Val Cys Gly Asp Tyr Leu Leu Ser Arg Met 115 120 125 gat gtt accagc tgc atc tct tac cga aat ttt gca agt tgt atg gga 972 Asp Val Thr SerCys Ile Ser Tyr Arg Asn Phe Ala Ser Cys Met Gly 130 135 140 145 gac tcccat ttg ttg aat aag gtt gat gct tat att cag gag cat ttg 1020 Asp Ser HisLeu Leu Asn Lys Val Asp Ala Tyr Ile Gln Glu His Leu 150 155 160 tta caaatt tct gaa gag gag gag ttt ctt aag ctt cca agg cta aag 1068 Leu Gln IleSer Glu Glu Glu Glu Phe Leu Lys Leu Pro Arg Leu Lys 165 170 175 ttg gaggta atg ctt gaa gat aat gtt tgc ttg ccc agc aat ggc aaa 1116 Leu Glu ValMet Leu Glu Asp Asn Val Cys Leu Pro Ser Asn Gly Lys 180 185 190 tta tataca aag gta atc aac tgg gtg cag cgt agc atc tgg gag aat 1164 Leu Tyr ThrLys Val Ile Asn Trp Val Gln Arg Ser Ile Trp Glu Asn 195 200 205 gga gacagt ctg gaa gag ctg atg gaa gag gtt caa acc ttg tac tac 1212 Gly Asp SerLeu Glu Glu Leu Met Glu Glu Val Gln Thr Leu Tyr Tyr 210 215 220 225 tcagct gat cac aag ctg ctt gat ggg aac cta cta gat gga cag gct 1260 Ser AlaAsp His Lys Leu Leu Asp Gly Asn Leu Leu Asp Gly Gln Ala 230 235 240 gaggtg ttt ggc agt gat gat gac cac att cag ttt gtg cag aaa aag 1308 Glu ValPhe Gly Ser Asp Asp Asp His Ile Gln Phe Val Gln Lys Lys 245 250 255 ccacca cgt gag aat ggc cat aag cag ata agt agc agt tca act gga 1356 Pro ProArg Glu Asn Gly His Lys Gln Ile Ser Ser Ser Ser Thr Gly 260 265 270 tgtctc tct tct cca aat gct aca gta caa agc cct aag cat gag tgg 1404 Cys LeuSer Ser Pro Asn Ala Thr Val Gln Ser Pro Lys His Glu Trp 275 280 285 aaaatc gtt gct tca gaa aag act tca aat aac act tac ttg tgc ctg 1452 Lys IleVal Ala Ser Glu Lys Thr Ser Asn Asn Thr Tyr Leu Cys Leu 290 295 300 305gct gtg ctg gat ggt ata ttc tgt gtc att ttt ctt cat ggg aga aac 1500 AlaVal Leu Asp Gly Ile Phe Cys Val Ile Phe Leu His Gly Arg Asn 310 315 320agc cca cag agc tca cca aca agt act cca aaa cta agt aag agt tta 1548 SerPro Gln Ser Ser Pro Thr Ser Thr Pro Lys Leu Ser Lys Ser Leu 325 330 335agc ttt gag atg caa caa gat gag cta atc gaa aag ccc atg tct cct 1596 SerPhe Glu Met Gln Gln Asp Glu Leu Ile Glu Lys Pro Met Ser Pro 340 345 350atg cag tac gca cga tct ggt ctg gga aca gca gag atg aat ggc aaa 1644 MetGln Tyr Ala Arg Ser Gly Leu Gly Thr Ala Glu Met Asn Gly Lys 355 360 365ctc ata gct gca ggt ggc tat aac aga gag gaa tgt ctt cga aca gtc 1692 LeuIle Ala Ala Gly Gly Tyr Asn Arg Glu Glu Cys Leu Arg Thr Val 370 375 380385 gaa tgc tat aat cca cat aca gat cac tgg tcc ttt ctt gct ccc atg 1740Glu Cys Tyr Asn Pro His Thr Asp His Trp Ser Phe Leu Ala Pro Met 390 395400 aga aca cca aga gcc cga ttt caa atg gct gta ctc atg ggc cag ctc 1788Arg Thr Pro Arg Ala Arg Phe Gln Met Ala Val Leu Met Gly Gln Leu 405 410415 tat gtg gta ggt gga tca aat ggc cac tca gat gac ctg agt tgt gga 1836Tyr Val Val Gly Gly Ser Asn Gly His Ser Asp Asp Leu Ser Cys Gly 420 425430 gag atg tat gat tca aac ata gat gac tgg att cct gtt cca gaa ttg 1884Glu Met Tyr Asp Ser Asn Ile Asp Asp Trp Ile Pro Val Pro Glu Leu 435 440445 aga act aac cgt tgt aat gca gga gtg tgt gct ctg aat gga aag tta 1932Arg Thr Asn Arg Cys Asn Ala Gly Val Cys Ala Leu Asn Gly Lys Leu 450 455460 465 tac atc gtt ggt ggc tct gat cca tat ggt caa aaa gga ctg aaa aat1980 Tyr Ile Val Gly Gly Ser Asp Pro Tyr Gly Gln Lys Gly Leu Lys Asn 470475 480 tgt gat gta ttt gat cct gta aca aag ttg tgg aca agc tgt gcc cct2028 Cys Asp Val Phe Asp Pro Val Thr Lys Leu Trp Thr Ser Cys Ala Pro 485490 495 ctt aac att cgg aga cac cag tct gca gtc tgt gag ctt ggt ggt tat2076 Leu Asn Ile Arg Arg His Gln Ser Ala Val Cys Glu Leu Gly Gly Tyr 500505 510 ttg tac ata atc gga ggt gca gaa tct tgg aat tgt ctg aac aca gta2124 Leu Tyr Ile Ile Gly Gly Ala Glu Ser Trp Asn Cys Leu Asn Thr Val 515520 525 gaa cga tac aat cct gaa aat aat acc tgg act tta att gca ccc atg2172 Glu Arg Tyr Asn Pro Glu Asn Asn Thr Trp Thr Leu Ile Ala Pro Met 530535 540 545 aat gtg gct agg cga gga gct gga gtg gct gtt ctt aat gga aaactg 2220 Asn Val Ala Arg Arg Gly Ala Gly Val Ala Val Leu Asn Gly Lys Leu550 555 560 ttt gta tgt ggt ggc ttt gat ggt tct cat gcc atc agt tgt gtggaa 2268 Phe Val Cys Gly Gly Phe Asp Gly Ser His Ala Ile Ser Cys Val Glu565 570 575 atg tat gat cca act aga aat gaa tgg aag atg atg gga cat atgact 2316 Met Tyr Asp Pro Thr Arg Asn Glu Trp Lys Met Met Gly His Met Thr580 585 590 tca cca agg agc aat gct ggg att gca act gta ggg aac acc atttat 2364 Ser Pro Arg Ser Asn Ala Gly Ile Ala Thr Val Gly Asn Thr Ile Tyr595 600 605 gca gtg gag gat tcg atg gca atg aat ttc tgaatacggcggaagtctat 2414 Ala Val Glu Asp Ser Met Ala Met Asn Phe 610 615aaccttgagt caaatgaatg gagcccctat acaaagattt tccagtttta acgggtttaa 2474gaccctctca aactaacagg cttagtgatg taattatggt tagcagaggt acacttgtga 2534ataaagaggg tgggtgggga tagatgttgc taacagcaac acaaagcttt tgcatattgc 2594atactattaa acatgctgta catacttttt gggtttattt gggaaggaat gcaaagatga 2654aggtctgttt gtgtactttt aagactttgg ttattttact ttttggaaaa gataaaccaa 2714gaattgattg ggcacatcaa aaaaaaaaaa aaaaaaaa 2752 15 619 PRT Homo sapiens15 Met Ile Pro Asn Gly Tyr Leu Met Phe Glu Asp Glu Asn Phe Ile Glu 1 510 15 Ser Ser Val Ala Lys Leu Asn Ala Leu Arg Lys Ser Gly Gln Phe Cys 2025 30 Asp Val Arg Leu Gln Val Cys Gly His Glu Met Leu Ala His Arg Ala 3540 45 Val Leu Ala Cys Cys Ser Pro Tyr Leu Phe Glu Ile Phe Asn Ser Asp 5055 60 Ser Asp Pro His Gly Ile Ser His Val Lys Phe Asp Asp Leu Asn Pro 6570 75 80 Glu Ala Val Glu Val Leu Leu Asn Tyr Ala Tyr Thr Ala Gln Leu Lys85 90 95 Ala Asp Lys Glu Leu Val Lys Asp Val Tyr Ser Ala Ala Lys Glu Leu100 105 110 Lys Met Asp Arg Val Lys Gln Val Cys Gly Asp Tyr Leu Leu SerArg 115 120 125 Met Asp Val Thr Ser Cys Ile Ser Tyr Arg Asn Phe Ala SerCys Met 130 135 140 Gly Asp Ser His Leu Leu Asn Lys Val Asp Ala Tyr IleGln Glu His 145 150 155 160 Leu Leu Gln Ile Ser Glu Glu Glu Glu Phe LeuLys Leu Pro Arg Leu 165 170 175 Lys Leu Glu Val Met Leu Glu Asp Asn ValCys Leu Pro Ser Asn Gly 180 185 190 Lys Leu Tyr Thr Lys Val Ile Asn TrpVal Gln Arg Ser Ile Trp Glu 195 200 205 Asn Gly Asp Ser Leu Glu Glu LeuMet Glu Glu Val Gln Thr Leu Tyr 210 215 220 Tyr Ser Ala Asp His Lys LeuLeu Asp Gly Asn Leu Leu Asp Gly Gln 225 230 235 240 Ala Glu Val Phe GlySer Asp Asp Asp His Ile Gln Phe Val Gln Lys 245 250 255 Lys Pro Pro ArgGlu Asn Gly His Lys Gln Ile Ser Ser Ser Ser Thr 260 265 270 Gly Cys LeuSer Ser Pro Asn Ala Thr Val Gln Ser Pro Lys His Glu 275 280 285 Trp LysIle Val Ala Ser Glu Lys Thr Ser Asn Asn Thr Tyr Leu Cys 290 295 300 LeuAla Val Leu Asp Gly Ile Phe Cys Val Ile Phe Leu His Gly Arg 305 310 315320 Asn Ser Pro Gln Ser Ser Pro Thr Ser Thr Pro Lys Leu Ser Lys Ser 325330 335 Leu Ser Phe Glu Met Gln Gln Asp Glu Leu Ile Glu Lys Pro Met Ser340 345 350 Pro Met Gln Tyr Ala Arg Ser Gly Leu Gly Thr Ala Glu Met AsnGly 355 360 365 Lys Leu Ile Ala Ala Gly Gly Tyr Asn Arg Glu Glu Cys LeuArg Thr 370 375 380 Val Glu Cys Tyr Asn Pro His Thr Asp His Trp Ser PheLeu Ala Pro 385 390 395 400 Met Arg Thr Pro Arg Ala Arg Phe Gln Met AlaVal Leu Met Gly Gln 405 410 415 Leu Tyr Val Val Gly Gly Ser Asn Gly HisSer Asp Asp Leu Ser Cys 420 425 430 Gly Glu Met Tyr Asp Ser Asn Ile AspAsp Trp Ile Pro Val Pro Glu 435 440 445 Leu Arg Thr Asn Arg Cys Asn AlaGly Val Cys Ala Leu Asn Gly Lys 450 455 460 Leu Tyr Ile Val Gly Gly SerAsp Pro Tyr Gly Gln Lys Gly Leu Lys 465 470 475 480 Asn Cys Asp Val PheAsp Pro Val Thr Lys Leu Trp Thr Ser Cys Ala 485 490 495 Pro Leu Asn IleArg Arg His Gln Ser Ala Val Cys Glu Leu Gly Gly 500 505 510 Tyr Leu TyrIle Ile Gly Gly Ala Glu Ser Trp Asn Cys Leu Asn Thr 515 520 525 Val GluArg Tyr Asn Pro Glu Asn Asn Thr Trp Thr Leu Ile Ala Pro 530 535 540 MetAsn Val Ala Arg Arg Gly Ala Gly Val Ala Val Leu Asn Gly Lys 545 550 555560 Leu Phe Val Cys Gly Gly Phe Asp Gly Ser His Ala Ile Ser Cys Val 565570 575 Glu Met Tyr Asp Pro Thr Arg Asn Glu Trp Lys Met Met Gly His Met580 585 590 Thr Ser Pro Arg Ser Asn Ala Gly Ile Ala Thr Val Gly Asn ThrIle 595 600 605 Tyr Ala Val Glu Asp Ser Met Ala Met Asn Phe 610 615 1647 PRT Homo sapiens 16 Lys Leu Ile Ala Ala Gly Gly Tyr Asn Arg Glu GluCys Leu Arg Thr 1 5 10 15 Val Glu Cys Tyr Asn Pro His Thr Asp His TrpSer Phe Leu Ala Pro 20 25 30 Met Arg Thr Pro Arg Ala Arg Phe Gln Met AlaVal Leu Met Gly 35 40 45 17 48 PRT Homo sapiens 17 Gln Leu Tyr Val ValGly Gly Ser Asn Gly His Ser Asp Asp Leu Ser 1 5 10 15 Cys Gly Glu MetTyr Asp Ser Asn Ile Asp Asp Trp Ile Pro Val Pro 20 25 30 Glu Leu Arg ThrAsn Arg Cys Asn Ala Gly Val Cys Ala Leu Asn Gly 35 40 45 18 49 PRT Homosapiens 18 Lys Leu Tyr Ile Val Gly Gly Ser Asp Pro Tyr Gly Gln Lys GlyLeu 1 5 10 15 Lys Asn Cys Asp Val Phe Asp Pro Val Thr Lys Leu Trp ThrSer Cys 20 25 30 Ala Pro Leu Asn Ile Arg Arg His Gln Ser Ala Val Cys GluLeu Gly 35 40 45 Gly 19 47 PRT Homo sapiens 19 Tyr Leu Tyr Ile Ile GlyGly Ala Glu Ser Trp Asn Cys Leu Asn Thr 1 5 10 15 Val Glu Arg Tyr AsnPro Glu Asn Asn Thr Trp Thr Leu Ile Ala Pro 20 25 30 Met Asn Val Ala ArgArg Gly Ala Gly Val Ala Val Leu Asn Gly 35 40 45 20 47 PRT Homo sapiens20 Lys Leu Phe Val Cys Gly Gly Phe Asp Gly Ser His Ala Ile Ser Cys 1 510 15 Val Glu Met Tyr Asp Pro Thr Arg Asn Glu Trp Lys Met Met Gly His 2025 30 Met Thr Ser Pro Arg Ser Asn Ala Gly Ile Ala Thr Val Gly Asn 35 4045 21 22 PRT Homo sapiens 21 Lys Leu Tyr Gly Gly Leu Val Glu Tyr Asp ProTrp Ala Pro Met Arg 1 5 10 15 Ala Gly Val Ala Leu Gly 20 22 19 PRT Homosapiens 22 Ala Ser Gln Gly Thr Lys Arg Ser Tyr Glu Gln Met Glu Thr AspGly 1 5 10 15 Glu Arg Gln 23 8 PRT Homo sapiens SITE (5)...(7)Xaa=unknown amino acid residue 23 Thr Lys Arg Ser Xaa Xaa Xaa Met 1 5 2419 DNA Artificial Sequence oligonucleotide 24 gcaaagcagg agaaaccac 19 2524 DNA Artificial Sequence primer 25 gggtccatct gatagatatg agag 24 26 48DNA/RNA Artificial Sequence primer 26 cuacuacuac uaggccacgc gtcgactactacgggnnggg nngggnng 48 27 20 DNA Artificial Sequence oligonucleotide 27tcctgatgtt gctgtagacg 20 28 20 DNA Artificial Sequence oligonucleotide28 gcacgactag tatgatttgc 20 29 8 PRT Homo sapiens 29 Thr Gly Ala Gly AlaGly Cys Gly 1 5 30 5 PRT Homo sapiens 30 Tyr Ser Ala Ala Lys 1 5 31 20DNA Artificial Sequence oligonucleotide 31 cattcctctc tgttatagcc 20 3217 DNA Artificial Sequence primer 32 ccacctgcag ctatgag 17 33 14 PRT VSV33 Lys Leu Gly Ile Ala Pro Pro Pro Tyr Glu Glu Asp Thr Ser 1 5 10 34 14PRT VSV 34 Lys Lys Met Gly Leu Pro Pro Pro Tyr Asp Glu Ser Cys Pro 1 510 35 14 PRT Rabies virus 35 Asp Leu Trp Leu Pro Pro Pro Glu Tyr Val ProLeu Lys Glu 1 5 10 36 14 PRT PIRY 36 Met Glu Trp Glu Ser Pro Pro Ser TyrAsn Glu Ile Lys Ser 1 5 10 37 14 PRT Carp virus 37 Lys Ser Lys Gly ThrPro Pro Thr Tyr Glu Glu Thr Leu Ala 1 5 10 38 14 PRT Ebola virus 38 IleLeu Pro Thr Ala Pro Pro Glu Tyr Met Glu Ala Ile Tyr 1 5 10 39 14 PRTMarburg virus 39 Met Gln Tyr Leu Asn Pro Pro Pro Tyr Ala Asp His Gly Ala1 5 10 40 11 PRT Homo sapiens SITE (2) Xaa=unknown amino acid residue 40Ser Xaa Gly Thr Lys Arg Ser Tyr Xaa Xaa Met 1 5 10 41 8 PRT Homo sapiensSITE (5)...(7) Xaa=unknown amino acid residue 41 Thr Lys Arg Ser Xaa XaaXaa Met 1 5 42 11 PRT Homo sapiens 42 Cys Gly Asp Leu Arg Leu Thr LeuLeu Glu Leu 1 5 10

What is claimed is:
 1. A method for identifying a test compound thatinhibits the interaction between an NS1-BP protein and an influenzavirus NS1 protein, said method comprising: a) contacting an NS1-BPprotein or a peptide fragment of the NS1-BP protein comprising thebinding site for an NS1 protein with an immobilized NS1 protein or apeptide fragment of the NS1 protein comprsing the binding site for anNS1-BP protein, under conditions and for a time sufficient to pemit theNS1-BP protein or NS1-BP peptide fragment to bind to and form a complexwith the NS1 protein or NS1 peptide fragment; b) removing unbound NS1-BPprotein or NS1-BP peptide fragment; and c) detecting the complex, so ifthere is a decrease in the complex detected in c) as compared to theamount of complex detected in the absence of the test compound, a testcompound that inhibits the interaction between an NS1-BP protein and anNS1 protein is identified.
 2. A method for identifying a test compoundthat inhibits the interaction between an NS1-BP protein and an influenzavirus NS1 protein, said method comprising: a) contacting an NS1 proteinor a peptide fragment of the NS1 protein comprising the binding site forthe NS1-BP protein with an immobilized NS1-BP protein or a peptidefragment of the NS1-BP protein comprising the binding site for the NS1protein, under conditions and for a time sufficient to permit the NS1protein or NS1 peptide fragment to bind to and form a complex with theNS1-BP protein or NS1-BP peptide fragment, in the presence of testcompound; b) removing unbound NS1 protein or NS1 peptide fragment; andc) detecting the complex, so if there is a decrease in the complexdetected in c) as compared to the amount of complex detected in theabsence of the test compound, a test compound that inhibits theinteraction between an NS1-BP protein and an NS1 protein is indentified.3. A method for identifying a test compound that inhibits theinteraction between an NS1-BP protein and an influenza virus NS1protein, said method comprising: a) contacting a fusion protein with animmobilized NS1 protein or a peptide fragment of the NS1 proteincomprising the binding site of the NS1 protein, under conditions and fora time sufficient to permit the fusion protein to bind to and form acomplex with the NS1 protein or NS1 peptide fragment, in the presence oftest compound, wherein the fusion protein comprises an NS1-BP protein ora peptide fragment of the NS1-BP protein comprising the binding site forNS1 protein; b) removing unbound fusion protein; and c) detecting thecomplex, so if there is a decrease in the complex detected in c) ascompared to the amount of complex detected in the absence of the testcompound, a test compound that inhibits the interaction between anNS1-BP protein and an NS1 protein is identified.
 4. A method foridentifying a test compound that inhibits the interaction between anNS1-BP protein and an influenza virus NS1 protein, said methodcomprising: a) contacting a fusion protein with an immobilized NS1-BPprotein or a peptide fragment of the NS1-BP protein comprising thebinding site of the NS1 protein, under conditions and for a timesufficient to permit the fusion protein to bind to and form a complexwith the NS1-BP protein or NS1-BP peptide fragment, in the presence oftest compound, wherein the fusion protein comprises an NS1 protein or apeptide fragment of the NS1 protein comprising the binding site forNS1-BP protein; b) removing unbound fusion protein; and c) detecting thecomplex, so if there is a decrease in the complex detected in c) ascompared to the amount of complex detected in the absence of the testcompound, a test compound that inhibits the interaction between anNS1-BP protein and an NS1 protein is identified. identified.
 5. A methodfor identifying a test compound that disrupts the interaction between anNS1-BP protein and an influenza virus NS1 protein, said methodcomprising: a) contacting an NS1-BP/NS1 protein complex with a testcompound, wherein the NS1-BP/NS1 protein complex comprises an NS1-BPprotein or a peptide fragment of the NS1-BP protein comprising thebinding site for an NS1 protein, and an NS1 protein or a peptidefragment of the NS1 protein comprising the binding site for an NS1-BPprotein; and b) detecting the NS1-BP/NS1 protein complex, so that ifthere is a decrease in the complex detected in b) as compared to theamount of complex detected in the absence of the test compound, a testcompound that disrupts the interaction between an NS1-BP protein and anNS1 protein is identified.
 6. A method for identifying a test compoundthat inhibits the interaction between an NS1-BP protein and an influenzavirus NS1 protein, said method comprising: a) contacting, in a reactionmixture, an NS1 protein or a peptide fragment of the NS1 proteincomprising the binding site for an NS1-BP protein with an NS1-BP proteinor a peptide fragment of the NS1-BP protein comprising the binding sitefor an NS1 protein, under conditions and for a time sufficient to permitthe NS1 protein or NS1 peptide fragment to bind to and form a complexwith the NS1-BP protein or NS1-BP peptide fragment, in the presence oftest compound; b) separating the complex from the reaction mixture; andc) detecting the complex, so if there is a decrease in the complexdetected in c) as compared to the amount of complex detected in theabsence of the test compound, a test compound that inhibits theinteraction between an NS1-BP protein and an NS1 protein is identified.7. A method for identifying a test compound that inhibits theinteraction between an NS1-BP protein and an influenza virus NS1protein, said method comprising: a) contacting an NS1 protein or apeptide fragment of the NS1 protein comprising the binding site for anNS1-BP protein with an immobilized fusion protein, under conditions andfor a time sufficient to permit the NS1 protein or NS1 peptide fragmentto bind to and form a complex with the fusion protein, in the presenceof test compound, wherein the fusion protein comprises an NS1-BP proteinor a peptide fragment of the NS1-BP protein comprising the binding sitefor an NS 1 protein; b) removing unbound NS1 protein or NS1 peptidefragment; and c) detecting the complex, so if there is a decrease in thecomplex detected in c) as compared to the amount of complex detected inthe absence of the test compound, a test compound that inhibits theinteraction between an NS1-BP protein and an NS1 protein is identified.8. A method for identifying a test compound that inhibits theinteraction between an NS1-BP protein and an influenza virus NS1protein, said method comprising: a) contacting an NS1-BP protein or apeptide fragment of the NS1-BP protein comprising the binding site foran NS1 protein with an immobilized fusion protein, under conditions andfor a time sufficient to permit the NS1-BP protein or NS1-BP peptidefragment to bind to and form a complex with the fusion protein, in thepresence of test compound, wherein the fusion protein comprises an NS1protein or a peptide fragment of the NS1 protein comprising the bindingsite for an NS1-BP protein; b) removing unbound NS1-BP protein or NS1-BPpeptide fragment; and c) detecting the complex, so if there is adecrease in the complex detected in c) as compared to the amount ofcomplex detected in the absence of the test compound, a test compoundthat inhibits the interaction between an NS1-BP protein and an NS1protein is identified.
 9. A method for identifying a test compound thatinhibits the interaction between an NS1-BP protein and an influenzavirus NS1 protein, said method comprising: a) contacting, in a reactionmixture, an NS1-BP protein or a peptide fragment of the NS1-BP proteincomprising the binding site for an NS1 protein with a fusion protein,under conditions and for a time sufficient to permit the NS1-BP proteinor NS1-BP peptide fragment to bind to and form a complex with the fusionprotein, in the presence of test compound, wherein the fusion proteincomprises an NS1 protein or a peptide fragment of the NS1 proteincomprising the binding site for an NS1-BP protein; b) separating thecomplex from the reaction mixture; and c) detecting the complex, so ifthere is a decrease in the complex detected in c) as compared to theamount of complex detected in the absence of the test compound, a testcompound that inhibits the interaction between an NS1-BP protein and anNS1 protein is identified.
 10. A method for identifying a test compoundthat inhibits the interaction between an NS1-BP protein and an influenzavirus NS1 protein, said method comprising: a) contacting, in a reactionmixture, an NS1 protein or a peptide fragment of the NS1 proteincomprising the binding site for an NS1-BP protein with a fusion protein,under conditions and for a time sufficient to permit the NS1 protein orNS1 peptide fragment to bind to the fusion protein and form a complexwith, in the presence of test compound, wherein the fusion proteincomprises an NS1-BP protein or a peptide fragment of the NS1-BP proteincomprising the binding site for an NS1 protein; b) separating thecomplex from the reaction mixture; and c) detecting the complex, so ifthere is a decrease in the complex detected in c) as compared to theamount of complex detected in the absence of the test compound, a testcompound that inhibits the interaction between an NS1-BP protein and anNS1 protein is identified.
 11. The method of claim 1 or 3, wherein theNS1 protein or NS1 peptide fragment is immobilized on a solid surface.12. The method of claim 1, 6, 8, or 9, wherein the NS1-BP protein orNS1-BP peptide fragment is directly or indirectly labeled.
 13. Themethod of claim 2 or 4, wherein the NS1-BP protein or NS1-BP peptidefragment is immobilized on a solid surface.
 14. The method of claim 2,6, 7, or 10, wherein the NS1 protein or NS1 peptide fragment is directlyor indirectly labeled.
 15. The method of claim 12, wherein the label isa radioisotope or an enzymatic label.
 16. The method of claim 14,wherein the label is a radioisotope or an enzymatic label.
 17. Themethod of claim 7 or 8, wherein the fusion protein is immobilized on asolid surface.
 18. The method of claim 3, 4, 9, or 10, wherein thefusion protein is directly or indirectly labeled.
 19. The method ofclaim 18, wherein the label is a radioisotope or an enzymatic label. 20.The method as in any one of claims 1-10, wheren the test compound is apeptide, antibody, or organic molecule.
 21. The method of claim 7 or 8,wherein the fusion protein further comprises glutathione-S-transferase.